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I. REAL PARTY IN INTEREST 

The real party in interest is E.I. du Pont de Nemours and Company 
(ftere/natfer "DuPont"), owner of the Application. 

II. RELATED APPEALS AND INTERFERENCES 

There are no other appeals or interferences known to Applicants, Applicants' 
legal representative, or DuPont that will directly affect or be directly affected by or 
have a bearing on the Board of Patent Appeals and Interferences' (hereinafter the 
"Board") decision in the present Appeal. 

III. STATUS OF THE CLAIMS 

Claims 26-29 stand rejected and are the subject of this Appeal. Originally- 
filed Claims 1-25 have been canceled. 

IV. STATUS OF AMENDMENTS 

No amendments were made to the claims in response to the Final Office 

Action. 

V. SUMMARY OF CLAIMED SUBJECT MATTER 

Claim 26, the only independent claim at issue, relates to an isolated 
polynucleotide comprising: (a) a nucleotide sequence encoding a polypeptide having 
brittle-1 activity (see, e.g., Applicants' Specification at page 6, lines 17-21), wherein 
the polypeptide has an amino acid sequence of at least 90% sequence identity when 
compared to SEQ ID NO: 18 (see, e.g., Applicants' Specification at page 8, line 34 - 
page 9, line 17; page 14, lines 16-36), or (b) a full-length complement of the 
nucleotide sequence of (a) (see, e.g., Applicants 1 Specification at page 8, lines 19- 
33; page 12, lines 3-7). 

VL GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

Whether claims 26-29 are supported by sufficient written description under 35 
U.S.C. § 112, 1 st Paragraph. 

Whether claims 26-29 are enabled under 35 U.S.C. § 112, 1 st Paragraph. 
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VII. ARGUMENT 

A. Claims 26-29 Comply with the Written Description Requirement of 35 
U.S.C. § 112, 1 st Paragraph. 

Claims 26-29 stand rejected under 35 USC 112, first paragraph, as failing to 
comply with the written description requirement. During the course of prosecution, 
the Examiner asserted that "the specification only describes a coding sequence from 
wheat that comprises SEQ ID NO: 17. Applicant does not describe other nucleic 
acids encompassed by the claims, and the structural and functional features that 
distinguish all such nucleic acids from other nucleic acids are not provided." June 
20, 2005, Non-Final Office Action, at 6 (hereinafter "Non-Final OA"). The Examiner 
further asserted that "[n]o description is provided as to the function of the encoded 
protein." Id.] see also March 8, 2006, Final Office Action, at 8 (hereinafter "First 
Final OA"). The Examiner thus concluded that "Applicant has not, in fact, described 
nucleic acids that encode a protein with 90% identity to SEQ ID NO: 18 within the full 
scope of the claims, and the specification fails to provide an adequate written 
description of the claimed invention." Non-Final OA, at 6; see also First Final OA, at 
9. 

Applicants' claimed invention, however, substantially conforms to Example 14 
of the "Synopsis of Application of Written Description Guidelines", 66 Fed. Reg. 1099 
(Jan. 5, 2001), available at http://www.uspto.gov/web/menu/written.pdf (last visited 
Oct. 5, 2007) (hereinafter "Written Description Guidelines"). In Example 14, the 
exemplary claim is directed to "A protein having SEQ ID NO:3 and variants thereof 
that are at least 95% identical to SEQ ID NO:3 and catalyze the reaction of A^B." 
Written Description Guidelines, at 53. Included in the Example 14 specification is an 
"indications] that procedures for making proteins with substitutions, deletions, 
insertions and additions is routine in the art and ... an assay for detecting the 
catalytic activity of the protein." Id. Under the "Analysis" section of Example 14, the 
requirements of 95% identity to SEQ ID NO:3 and having catalytic activity "are 
essential to the operation of the claimed invention." Id. The procedures of making 
and testing sequences having 95% identity to SEQ ID NO:3 are determined to be 
"conventional." Id. Example 14 concludes that 



4 



Serial No. 10/659,199 
Docket No. BB1157 US CNT 



[t]he single species disclosed is representative of the genus because 
all members have at least 95% structural identity with the reference 
compound and because of the presence of an assay which applicant 
provided for identifying all of the at least 95% identical variants of SEQ 
ID NO:3 which are capable of the specified catalytic activity. One of 
skill in the art would conclude that applicant was in possession of the 
necessary common attributes possessed by the members of the 
genus. 

Id. at 54-55. 

Applicants 7 claimed invention, though directed to the nucleotide sequences 
encoding the proteins having brittle-1 activity, is structured similarly to that of the 
Example 14 claim. The claimed nucleotide sequences encode proteins having 90% 
identity to SEQ ID NO: 18, with the encoded proteins having brittle-1 activity. Like 
Example 14, there is not substantial variation in the encoded proteins, because the 
entire genus must have 90% sequence identity to SEQ ID NO: 18 and have brittle-1 
activity. Procedures for producing proteins having 90% identity to SEQ ID NO: 18 are 
well-known in the art. For example, alterations in a nucleic acid fragment that result 
in the production of a chemically equivalent amino acid at a given site, but do not 
affect the functional properties of the encoded polypeptide, are well known in the art, 
and such alterations can be obtained by, for example, site directed mutagenesis as 
described in Sambrook et al. t Molecular Cloning: A Laboratory Manual] Cold Spring 
Harbor Laboratory Press: Cold Spring Harbor, 1989 (cited in Applicants' 
Specification at page 14, lines 9-12). Thus, a codon for the amino acid alanine, a 
hydrophobic amino acid, may be substituted by a codon encoding another less 
hydrophobic residue, such as glycine, or a more hydrophobic residue, such as 
valine, leucine, or isoleucine. Similarly, changes that result in substitution of one 
negatively charged residue for another, such as aspartic acid for glutamic acid, or 
one positively charged residue for another, such as lysine for arginine, can also be 
expected to produce a functionally equivalent product. Nucleotide changes which 
result in alteration of the N-terminal and C-terminal portions of the polypeptide 
molecule would also not be expected to alter the activity of the polypeptide. See, 
e.g., Applicants' Specification, page 7, line 29 - page 8, line 9. Alterations in nucleic 
acid fragments can also be introduced by error-prone polymerase chain reaction 
(PCR), a technique that is well known by those skilled in the art. See, e.g., U.S. 
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Patent Nos. 4,683,195 and 4,800,159; see also You L. & Arnold F.H., Protein Eng. 
9:77-83 (1996). Another technique for obtaining homologous proteins is directed 
molecular evolution. Unlike protein engineering, in which proteins are improved by 
making specific changes to them, directed evolution involved mutating genes in vitro 
and screening the resulting proteins for desired activity. See Crameri A. et a/., Nat. 
Biotech. 15:436-38 (1997). Other techniques are well known in the art, but 
Applicants refrain from providing such information as it would be merely cumulative 
of the information provided above. 

Further, Applicants provided a brittle-1 assay from Shannon et a/., Plant 
Physiol. 1 17:1235-52 (1998), used to identify the proteins having 90% sequence 
identity to SEQ ID NO: 18 that also have brittle-1 activity. The assay in Shannon et 
ai consists of measuring [ 14 C]Glucose uptake from ADP-Glucose (ADP-GIc) and 
incorporation into methanol- and water-insoluble products as described on page 
1239 and Table V of Shannon etal. In short, amyloplasts isolated from endosperm 
tissue are added to a reaction mixture (200 pi, final volume) containing 100 mM 
Bicine, pH 8.5, 0.5 M sorbitol, 12.5 mM EDTA, 10 mM GSH, 50 mM potassium 
acetate, and 4 mM [ 14 C]ADP-Glc plus 60 or 80 pi of amyloplast fraction. Amyloplasts 
are isolated from endosperm tissue of kernels aged between 10 to 14 days past 
pollination. 

It is well-established that an applicant need not disclose that which is known 
in the art. In re Buchner, 929 F.2d 660, 661, 18 USPQ2d 1331, 1332 (Fed. Cir. 
1991). Applicants' citation of this reference in the specification should thus be 
sufficient evidence of written description of the assay. 

Applicants thus respectfully submit, in accordance with Example 14 of the 
Written Description Guidelines, that the claimed invention is supported by sufficient 
written description in Applicants' Specification. 

The Board's recent decision in Ex parte Kubin, Appeal 2007-0819 (BPAI May 
31, 2007), available at http://www.uspto.gov/web/offices/dcom/bpai/prec/ 
fd070819.pdf (last visited Oct. 9, 2007), is distinguishable. In Kubin, the claim at 
issue was directed to "An isolated nucleic acid molecule comprising a polynucleotide 
encoding a polypeptide at least 80% identical to amino acids 22-221 of SEQ ID 
NO:2, wherein the polypeptide binds CD48." Kubin, Appeal 2007-0819, at 3. The 
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Kubin specification "does not disclose any variants in which the nucleotide sequence 
encoding amino acids 22-221 of SEQ ID NO:2 is varied." Id. at 13 (emphasis 
added). Further, the Board noted that there was no disclosure of "correlation 
between function (binding to CD48) and structure responsible for binding to CD48 
(other than the entire extracellular domain) such that the skilled artisan would have 
known what modifications could be made . . . without losing function." Id. In light of 
these facts, the Board concluded that "[possession may not be shown by merely 
describing how to obtain possession of members of the claimed genus or how to 
identify their common structural features." Id. at 16 (citing Univ. of Rochester v. GD 
Searle & Co., 358 F.3d 916, 927, 69 USPQ2d 1886, 1895 (Fed. Cir. 2004)). Of 
particular concern was Appellants failure to "describe! ] what domains of [SEQ ID 
NO:2] are correlated with the required binding to CD48, and thus [Appellants] have 
not described which . . . amino acids can be varied and still maintain binding." Id. 

Applicants' Specification, however, provides sufficient guidance as to what 
amino acids could be modified without affecting brittle-1 activity. While the Examiner 
has focused on lack of disclosure of sequences having 90% plus identity to SEQ ID 
NO: 18, Figure 1 of Applicants' Specification provides a sequence comparison 
between SEQ ID NO:18 and a known brittle-1 protein (SEQ ID NO:21) that has only 
57.3% identity to the claimed sequence, which provides a clear picture of regions of 
brittle-1 proteins that have high homology, and are thus likely more susceptible to 
modification, and regions having low or no homology where more modifications can 
be made. For example, at the N-termini of these sequences, ten of the first 1 1 
amino acids are identical, with the lone difference being a conservative amino acid 
substitution of the valine at amino acid seven of SEQ ID NO:21 to an alanine in SEQ 
ID NO: 18. See, e.g., Wu T.D. & Brutlag D.L., Proc. Int. Conf. Intell. Syst. Mol. Biol. 
4:230-40 (1996). To the skilled artisan, the significant sequence identity at the N- 
terminus indicates that little or no sequence substitution should be made there, and if 
made that conservative substitutions would be preferred, in order to maintain brittle-1 
activity. Another example of high homology is amino acids 137-219 of SEQ ID 
NO:21 and amino acids 125-207 of SEQ ID NO: 18. Of these 83 amino acids, only 
six are different. Four of these substitutions are conservative (two glutamines to 
arginines, an asparagine to threonine, and a phenylalanine to tyrosine), while two 
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are non-conservative (an isoleucine to serine and a threonine to proline). Other 
regions of high homology, for example amino acids 228-418 of SEQ ID NO:21 and 
amino acids 216-404 of SEQ ID NO: 18, provide further guidance as to where and 
what type of substitution could be made. 

By contrast, the C-termini of these proteins are significantly different. After 
amino acid 404 of SEQ ID NO:18 and amino acid 418 of SEQ ID NO:21, not only 
does SEQ ID NO:18 contain 1 1 additional amino acids compared to SEQ ID NO:21, 
but there is also very little sequence homology between the two sequences. Thus, 
the skilled artisan could expect that amino acid substitutions, deletions, and/or 
additions in this region would have little effect on brittle-1 activity as compared to, for 
example, the N-terminal regional. Another region of low sequence homology can be 
found at amino acids 54-136 of SEQ ID NO:21 and amino acids 61-124 of SEQ ID 
NO: 18. Similar to the C-termini of SEQ ID NOs: 18 and 21, there is a significant 
difference in amino acid count in this region (83 amino acids for SEQ ID NO:21 and 
64 amino acids for SEQ ID NO: 18). The skilled artisan could thus conclude that this 
region of brittle-1 proteins can have significant amino acid substitutions, deletions, 
and/or substitutions yet still retain brittle-1 activity. Applicants thus submit that the 
specification provides sufficient guidance as to what regions of SEQ ID NO: 18 could 
be modified and in what way to produce a protein having (1) at least 90% identity to 
SEQ ID NO:18 and (2) have brittle-1 activity. 

Consequently, Kubin does not compel a result of lack of written description 
here and, indeed, should support Applicants' assertion of adequate written 
description for the claimed invention because Applicants have described a 
correlation between structure and function for SEQ ID NO:18. Cf. Kubin, Appeal 
2007-0819, at 17 ("Without a correlation between structure and function, 
[Appellants'] claim does little more than define the claimed invention by function."). 

Similarly, the cases cited by the Examiner in the Non-Final OA in support of 
the written description rejection do not compel a result of insufficient written 
description. The Examiner cites Regents of the University of California v. Eli Lilly & 
Co., 119 F.3d 1559, 43 USPQ2d 1398 (Fed. Cir. 1997), for the principle that w [t]he 
name cDNA is not itself a written description of that DNA; it conveys no 
distinguishing information concerning its identity." Non-Final OA, at 6 (quoting Eli 
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Lilly, 119 F.3d at 1567, 43 USPQ2d at 1405). In Eli Lilly, the patentee described one 

mammalian insulin cDNA (and thus one vertebrate insulin cDNA), rat insulin cDNA. 

Eli Lilly, 119 F.2d at 1568, 43 USPQ2d at 1405. One generic claim at issue in Eli 

Lilly covered all mammalian insulin cDNAs, with no limitation as to identity to the 

disclosed rat sequence. Id. at 1563, 1401. The Federal Circuit thus concluded, as 

quoted by the Examiner, that 

a generic statement such as Vertebrate insulin cDNA' or 'mammalian 
insulin cDNA,' without more, is not an adequate written description of 
the genus because it does not distinguish the claimed genus from 
others, except by function. It does not specifically define any of the 
genes that fall within its definition. It does not define any structural 
features commonly possessed by members of the genus that 
distinguish them from others. One skilled in the art therefore cannot as 
one can do with a fully described genus, visualize or recognize the 
identity of the members of the genus. A definition by function, as we 
have previously indicated, does not suffice to define the genus 
because it is only an indication of what the gene does, rather than what 
it is. 

Non-Final OA, at 6 (quoting Eli Lilly, 119 F.3d at 1568, 43 USPQ2d at 1406). Here, 
Applicants claimed invention is distinguished from other "genera" of brittle-1 proteins 
by claimed identity to SEQ ID NO:18 . All genes covered by the claimed genus are 
instantly recognizable from those not covered by the claim. The claimed genus is 
not defined solely by function but, as described above, is defined by the structure of 
SEQ ID NO:18 and its relationship to known brittle-1 proteins. The Examiner's 
reliance on Eli Lilly is thus misplaced. 

The other case that the Examiner cited in the Non-Final OA was Amgen, Inc. 
v. ChugaiPharm. Co., 927 F.2d 1200,18 USPQ2d 1016 (Fed. Cir. 1991), for the 
principle that "[i]t is not sufficient to define [a gene] solely by its principal biological 
property, e.g., encoding human erythropoietin, because an alleged conception 
having no more specificity than that is simply a wish to know the identity of any 
material with that biological property." Non-Final OA, at 7 (quoting Amgen, 927 F.2d 
at 1206, 18 USPQ2d at 1021). While Applicants do not question the propriety of this 
statement, it relates to conception, not written description. Applicants believe that 
the skilled artisan would not question that Applicants conceived the claim 26 
invention. In the statement following the Examiner's quote from Amgen, the Federal 
Circuit held that "when an inventor is unable to envision the detailed constitution of a 
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gene so as to distinguish it from other materials, as well as a method for obtaining it, 
conception has not been achieved until reduction to practice has occurred, i.e., until 
after the gene has been isolated." Amgen, 927 F.2d at 1206, 18 USPQ2d at 1021. 
Under this test, Applicants have (1) detailed the constitution of SEQ ID NO: 17 and its 
variants that encode polypeptides having at least 90% identity to SEQ ID NO:18 so 
as to distinguish these brittle-1 genes from the prior art, (2) have detailed numerous 
methods for obtaining the claimed genes (see, e.g., page 15, line 11- page 16, line 
22), and (3) in any event have isolated SEQ ID NO: 17 from maize (that is, reduced it 
to practice). Thus, Applicants conceived the claimed invention. 

Other recent Federal Circuit cases support Applicants' position that the claims 
are supported by sufficient written description. In Invitrogen Corp. v. Clontech Labs., 
Inc., 429 F.3d 1052, 77 USPQ2d 1161 (Fed. Cir. 2005), the claimed invention related 
to modified reverse transcriptases (RTs) having DNA polymerase activity and 
reduced RNase H activity, wherein the RTs were encoded by nucleotide sequences 
derived from a retrovirus, yeast, Neurospora, Drosophila, primates, or rodents. 
Invitrogen, 429 F.3d at 1072, 77 USPQ2d at 1 174. The unmodified RTs from this list 
of organisms were known, and the specification provided test data for the one 
disclosed modified RT (that is, the only disclosed sequence actually covered by the 
claim). Id. In affirming the district court's finding of sufficient written description, the 
Federal Circuit distinguished Eli Lilly and Fiers v. Revel, 984 F.2d 1164, 25 USPQ2d 
1601 (Fed. Cir. 1993), noting that "[i]ln those cases, the patent specifications at issue 
did not identify the sequence (structure) of any embodiment of DNA claimed therein." 
Invitrogen, 429 F.3d at 1073, 77 USPQ2d at 1 175-76. Here, Applicants claimed 
genus is of a similar, and probably smaller, scope as that in Invitrogen.^ Applicants' 
Specification discloses SEQ ID NO: 18 (equivalent to the known RT sequences), and 
Applicants claim a genus having identity thereto in combination with brittle-1 function. 
While Applicants did not provide test data, the lack of working examples should not 
affect the written description analysis. See Falkner v. Inglis, 448 F.3d 1357, 1366, 
79 USPQ2d 1001, 1007 (Fed. Cir. 2006) (confirming that " examples are not 



1 The scope of the Invitrogen genus covers any modification to any RT from any of the claimed 
organisms, so long as the RT has DNA polymerase activity and reduced RNase H activity. 
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necessary to support the adequacy of a written description . . ." (emphasis in 
original)). 

In In re Wallach, 378 F.3d 1330, 71 USPQ2d 1939 (Fed. Cir. 2004), the 

applicants there disclosed actual possession of a tumor necrosis factor binding 

protein II ("TBP-II"). Wallach, 378 F.3d at 1331, 71 USPQ2d at 1940. From the 

isolated TBP-II protein, the applicants obtained the N-terminal portion (the first ten 

amino acids) of the isolated protein. Id. The applicants demonstrated that the 

isolated TBP-II inhibited the cytotoxic effect of tumor necrosis factor. Id. From this 

disclosure, the applicants claimed isolated polynucleotides encoding TBP-II. Id. at 

1331-32, 1940. In finding lack of written description for the polynucleotide claims, 

the Federal Circuit first noted that 

the complete amino acid sequence of a protein may put one in 
possession of the genus of DNA sequences encoding it, and that one 
of ordinary skill in the art . . . may have therefore been in possession of 
the entire genus of DNA sequences that can encode the disclosed 
partial protein sequence, even if individual species within that genus 
might not have been described or rendered obvious. 

Id. at 1333, 1942. The Wallach applicants failed this test because, while they may 

have had actual possession of TBP-II, "possession of the protein says nothing about 

whether they were in possession of the protein's amino acid sequence." Id. at 1334, 

1943. Applicants here, however, demonstrated possession of the amino acid 

sequences covered encoded by the claimed polynucleotides, and, as noted in 

Wallach, Applicants should not be required to "list every possible permutation of the 

nucleic acid sequences that can encode a particular protein for which the amino acid 

sequence is disclosed. . . ." Id. at 1334, 1942. 

As noted in Capon v. Eshhar, 418 F.3d 1349, 76 USPQ2d 1078 (Fed. Cir. 

2005) "[precedent distinguishes among generic inventions that are adequately 

supported, those that are merely a 'wish 1 or 'plan,' the words of Fiers v. Revel, 984 

F.2d at 1171, and those in between, as illustrated by Noelle v. Lederman, 355 F.3d 

[1343, 1350, 69 USPQ2d 1508 (Fed. Cir. 2004)]; the facts of the specific case must 

be evaluated." Capon, 418 F.3d at 1360, 76 USPQ2d at 1086. In Fiers, one party in 

an interference claimed that disclosure of how a skilled artisan could, by reverse 

transcription, obtain an undisclosed DNA sequence was sufficient to support written 
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description of the count. 2 F/ers, 984 F.2d at 1170, 25 USPQ2d at 1605-06. The 
Board disagreed, and the Federal Circuit affirmed, holding that "[a]n adequate written 
description of a DNA requires more than a mere statement that it is part of the 
invention and reference to a potential method for isolating it; what is required is a 
description of the DNA itself." Id. The specification at issue "just represented] a 
wish, or arguably a plan, for obtaining the DNA." Id. at 1171, 1606. Conversely, 
Applicants' claimed invention cannot be considered a mere wish or plan, because 
Applicants disclosed the sequence of wheat brittle-1 protein, SEQ ID NO: 18, along 
with the actual nucleotide sequence that encodes the protein, SEQ ID NO: 17. 
Applicants thus provided a description of the DNA itself. In A/oe//e, the patentee 
claimed human CD40CR antibodies on the basis of a American Type Culture 
Collection deposit of a hybridoma that secreted the mouse form of the antibody. 
A/oe//e, 355 F.3d at 1349, 69 USPQ2d at 1514. The patentee failed to disclose any 
structural elements of the mouse antibody and failed to disclose a fully characterized 
antigen. Id. The Federal Circuit found this lack of disclosure fatal to the patentee's 
asserted written description sufficiency, holding that no structural elements for the 
antibody were disclosed in Noelle specification. Id. at 1349-50, 1514 (noting that the 
patentee "attempted to define an unknown by its binding affinity to another 
unknown"). Though not explicitly stated in Noelle, the court implied that deposit of 
the hybridoma was sufficient disclosure of the mouse antibody in accordance with 
Enzo Biochem, Inc. v. Gen-Probe Inc., 323 F.3d 956, 964, 63 USPQ2d 1609, 1613 
(Fed. Cir. 2002). Noelle, 355 F.3d at 1349, 69 USPQ2d at 1514. Unlike Noelle's "in 
between" position on the written description spectrum, Applicants' Specification 
includes description of brittle-1 structural elements, that is, the actual wheat 
sequence set forth in SED ID NO: 18. Because Applicants' Specification contains 
structural disclosure beyond that of Fiers (only a wish or plan there) and Noelle 
(description of mouse antibody only through deposit of hybridoma producing that 
antibody), Applicants respectfully submit that their Specification describes the 
claimed genus of brittle-1 -encoding polynucleotides. 



2 The count read: "A DNA which consists essentially of a DNA which codes for a human fibroblast 
interferon-beta polypeptide." F/ers, 984 F.2d at 1166, 25 USPQ2d at 1603. 

12 



Serial No. 10/659,199 
Docket No. BB1 157 US CNT 

In light of the above arguments, Applicants respectfully request withdrawal of 
the rejections of claims 26 and 30-40 under 35 U.S.C. § 1 12, first paragraph, written 
description. 

B. Claims 26-29 are Enabled Under 35 U.S.C. § 112, 1 st Paragraph. 

Claims 26-29 stand rejected under 35 USC 112, first paragraph, because the 
Applicants 1 Specification while being enabling for nucleic acids encoding a SEQ ID 
NO: 18 and constructs and vectors comprising them, allegedly does not reasonably 
provide enablement for nucleic acids encoding a protein with 90% identity to SEQ ID 
NO: 18 and constructs and vectors comprising them. During prosecution, the 
Examiner asserted that "[t]he instant specification fails to provide guidance for how to 
make or isolate nucleic acids encoding proteins with 90% identity to SEQ ID 
NO:18 — specific hybridization or PCR conditions, probes or primers are not recited." 
Non-Final OA, at 3. Further, the Examiner asserted that "[t]he instant specification 
fails to teach essential regions of the encoded protein." Id. at 3-4. The Examiner 
also stated that 

[t]he instant specification fails to provide guidance for which amino 
acids of SEQ ID NO: 18 can be altered and to which other amino acids, 
and which amino acids must not be changed, to maintain adenylate 
translocator activity of the encoded protein. The specification also fails 
to provide guidance for which amino acids can be deleted and which 
regions of the protein can tolerate insertions and still produce a 
functional enzyme. 

Id. at 4. The Examiner thus concluded that, because Applicants' Specification 
allegedly does not describe transformation of plants with a gene encoding a protein 
having 90% identity to SEQ ID NO: 18, "undue trial and error experimentation would 
be required to screen through the myriad of nucleic acids encompassed by the 
claims and plants transformed therewith, to identify those with altered starch, if such 
plants are even obtainable." Id. at 5. 

Applicants agree with the Examiner that a specification must enable one of 
ordinary skill in the art to make and use the claimed invention without undue 
experimentation. Applicants respectfully submit, however, that the Examiner's 
conclusion of nonenablement of sequences having 90% identity to SEQ ID NO: 18 is 
erroneous because any experimentation needed to practice the present invention 
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would be routine. "[A] patent specification complies with the statute even if a 
'reasonable' amount of routine experimentation is required in order to practice a 
claimed invention, but that such experimentation must not be 'undue."' Enzo 
Biochem, Inc. v. Calgene, Inc., 188 F.3d 1362, 1371, 52 USPQ2d 1129, 1135 (Fed. 
Cir. 1999). Factors to consider when deciding whether experimentation is undue 
include: "(1) the quantity of experimentation necessary, (2) the amount of direction 
or guidance presented, (3) the presence or absence of working examples, (4) the 
nature of the invention, (5) the state of the prior art, (6) the relative skill of those in 
the art, (7) the predictability or unpredictability of the art, and (8) the breadth of the 
claims." In re Wands, 858 F.2d 731, 737, 8 USPQ2d 1400, 1404 (Fed. Cir. 1988). 
Applicants address each of the Wands factors below. 

(1) The quantity of experimentation needed is quite low. As noted above, 
methods of producing nucleotide sequences are well-known in the art. A method of 
testing for brittle-1 activity cited in Applicants' Specification is known in the art and is 
summarized in the Written Description section of this brief. 

(2) Applicants' Specification provides sufficient direction for producing 
nucleotide sequences encoding proteins having 90% identity to SEQ ID NO: 18 and a 
specific assay for brittle-1 activity is provided. 

(3) Applicants admit that there are no working examples showing brittle-1 

activity in Applicants' Specification. A specification's lack of working examples, 

however, does not automatically equate to nonenablement of the claimed invention. 

A claim will not be invalidated on section 1 12 grounds simply because 
the embodiments of the specification do not contain examples explicitly 
covering the full scope of the claim language. That is because the 
patent specification is written for a person of skill in the art, and such a 
person comes to the patent with the knowledge of what has come 
before. 

LizardTech, Inc. v. Earth Resource Mapping, Inc., 424 F.3d 1336, 1345, 76 USPQ2d 
1724, 1733 (Fed. Cir. 2005) (internal citation omitted). 

(4) The invention is one of nucleotide sequences encoding proteins having 
brittle-1 activity. Such an invention requires some experimentation for even routine 
techniques. 

(5) Brittle 1 protein (BT1), the major protein in amyloplast envelope 
membranes of wild-type maize endosperm, is required for normal levels of starch 

14 
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accumulation in maize kernels. Loss of BT1 in the brittlel (bt1) mutant results in an 
80% reduction in kernel starch. The relation of BT1 to starch accumulation was 
investigated in two well-characterized maize endosperm suspension-cultured cell 
lines which were derived from 10 days post pollination (DPP) kernels of inbred A636 
and 12-DPP kernels of the waxy mutant in the A636 background. Starch in A636 
endosperm cultures accounted for about 1 .5% of the fresh weight of cells but BT1 
was not detectable in amyloplast membranes or in microsomal membranes isolated 
from the cultured cells. See Cao H. & Shannon, J.C., Physiol. Plantarum 97:665-73 
(1996). 

In vivo studies of carbohydrate metabolism during development of maize 
kernels showed that bt1 kernels accumulated more than 13 times as much 
adenosine 5-diphosphoglucose (ADP-GIc) as normal kernels. Activity of starch 
synthase in bt1 endosperm was equal to that in endosperm extracts from normal 
kernels. The ADP-GIc accumulation in bt1 endosperm cells was not due to a 
deficiency in starch synthase, and results indicated that AGPase is the predominant 
enzyme responsible for in vivo synthesis of ADP-GIc in bt1 mutant kernels. See 
Shannon J.C. et a/., Plant Physiol. 110:835-43 (1996). 

(6) This invention is related to the biotechnical arts in a well-known pathway, 
the transport of ADP-glucose from the cytosol to the plastid, and the skill level of the 
artisan is very high. See, e.g., Kubin, Appeal 2007-0819, at 14 (noting that the level 
of skill in the molecular biology art is high). The skilled artisan is therefore very 
familiar with the pathway and well versed in many methods and techniques of, for 
example, gene manipulation, protein synthesis, and enzyme action. 

(7) Claim 26 is directed to a nucleotide sequence encoding a protein having a 
specified activity. It is unreasonable for Applicants to provide a cookbook recipe of 
how to practice the invention. Rather, Applicants have depended on the skill and 
experience of the skilled artisan to implement the invention using nucleotide 
sequences encoding polypeptides having brittle-1 activity. Applicants expect that the 
skilled artisan would be aware of successful molecular biology and biochemistry 
methods and therefore be capable of producing the described sequences and testing 
these sequences for brittle-1 activity. 
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(8) The Examiner's concerns about the number of possible sequences having 
90% identity to SEQ ID NO:18 are unfounded. See, e.g., Second Final OA, at 4 
("Making all possible single amino acid substitutions in an [sic] 432 amino acid long 
protein . . . would require making and analyzing 19 432 (2.6 x 10 552 ) nucleic acids; 
these proteins would have 99.8% identity to SEQ ID NO:18."). Indeed, the number 
of possible claimed sequences should not itself form the basis of an enablement 
rejection. See, e.g., NovozymesA/S v. Genencor Int'l, Inc., 446 F. Supp. 2d 297, 
330 (D. Del. 2006) (noting that, with claims 95% identity to a disclosed sequence, a 
"large number [of possible sequences] alone is not sufficient to show a lack of 
enablement . . ."). 

Outside of factor (3), the Wands factors support Applicants assertion that any 
experimentation required to practice the present claims would be routine. "It is well 
established that a patent applicant is entitled to claim his invention generically when 
he describes it sufficiently to meet the [enablement requirement]." Amgen, 927 F.2d 
at 1213, 18 USPQ2d at 1027. In Amgen, the court found that a generic claim 
covering all possible DNA sequences encoding any polypeptide having an amino 
acid sequence "sufficiently duplicative" of erythropoietin ("EPO") and which causes 
bone marrow cells to increase production of reticulocytes and red blood cells, and 
increases hemoglobin synthesis or iron uptake as being nonenabled where the 
patentee only provided information "of how to make EPO and very few analogs." Id. 
at 1213-14, 1027. As noted in Novozymes, however, "[t]he problem in [Amgen] was 
that the claim scope covered any gene that could be used to express proteins of 
various sizes that had one or more of the biological properties of EPO." Novozymes, 
446 F. Supp. 2d at 330 (emphasis added). Unlike the patentee in Amgen, Applicants 
are not claiming all nucleotide sequences encoding enzymes having brittle-1 activity 
but merely those having 90% identity to SEQ ID NO:18. Even the Amgen court 
recognized that the enablement requirement should not be extended beyond 
reasonableness when it noted that the disclosure at issue there might have been 
sufficient to enable a claim for EPO analogs similar to those described in that 
specification. Amgen , 927 F.2d at 1213, 18 USPQ2d at 1027 (noting that the 
patentee's "disclosure might well justify a generic claim encompassing these and 
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similar analogs, but it represents inadequate support for [patentee's] desire to claim 
all EPO gene analogs"). 

Applicants' situation is similar to that in Novozymes. There, patentee's claim 
1 read: 

A variant of a parent Bacillus stearothermophilus alpha-amylase, 
wherein the variant has an amino acid sequence which has at least 
95% homology to the parent Bacillus stearothermophilus alpha- 
amylase and comprises a deletion of amino acids 179 an [sic] 180, 
using SEQ ID NO:3 for numbering, and wherein the variant has alpha- 
amylase activity. 

Novozymes, 446 F. Supp. 2d at 306. Novozymes concluded that "requiring at least 
95% homology with [the identified sequence] makes the variants sufficiently similar 
so that the enablement requirement is satisfied. By contrast to Amgen, the claim 
scope [in Novozymes] is limited quantitatively to similarity between protein 
sequences and not just to a requirement for alpha-amylase-like activity." 
Novozymes, 446 F. Supp. 2d at 300. Applicants' current claims are similarly 
structured. There is a quantitative limit to the similarity between SEQ ID NO: 18 and 
other proteins in the claimed genus and all proteins having 90% identity to SEQ ID 
NO: 18 must have brittle-1 activity. Thus, Applicants' claimed invention should be 
sufficiently enabled. 

Further evidencing enablement of claim 26 is that the novel aspect of the 
invention is enabled in Applicants' Specification. In a recent Federal Circuit case, 
the court clarified that "[although the knowledge of one skilled in the art is indeed 
relevant [to an enablement determination], the novel aspect of an invention must be 
enabled in the patent." Auto. Techs. Int'l, Inc. v. BMW of N. Am., Inc., 2007 U.S. 
App. LEXIS 21271, at *22 (Fed. Cir. Sept. 6, 2007). In the present application, the 
novel aspect of the invention is the sequence set forth in SEQ ID NO:18 and variants 
thereof. As SEQ ID NO: 18 was present in the sequence listing, which is considered 
part of the specification as filed, the novel aspect of the invention is enabled in the 
specification. Whether or not the claimed sequences have brittle-1 activity is 
irrelevant to the novelty of the claimed sequences; a claim directed solely to "a 
nucleotide sequence encoding an amino acid sequence having 90% identity to SEQ 
ID NO: 18" would be novel without the brittle-1 activity limitation, which is present for 
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section 112 purposes only. Indeed, brittle-1 activity itself is not novel; as Applicants' 

Specification notes, 

[b]rittle-1 is one of several corn genes that, when mutated, cause the 
accumulation of sugars, rather than starch, in developing corn seeds. 
It has been shown that the brittle-1 gene encodes a plastidic 
membrane transporter that is involved in the transport of ADP-glucose 
from the cytosol to the plastid where it is used for starch biosynthesis. 

Applicants' Specification, at page 1, lines 12-15 (citation omitted). Therefore, this 

knowledge can be imputed from those skilled in the art to supplement the present 

disclosure, as routine experimentation (a brittle-1 assay) provides the determination 

of whether a sequence having 90% identity to SEQ ID NO: 18 is within the scope of 

the claim 26 invention. See Invitrogen, 429 F.3d at 1070-71, 77 USPQ2d at 1173 

("The scope of enablement ... is that which is disclosed in the specification plus the 

scope of what would be known to one of ordinary skill in the art without undue 

experimentation." (quoting Natl Recovery Techs., Inc. v. Magnetic Separation Sys., 

Inc., 166 F.3d 1190, 1194, 49 USPQ2d 1671, 1674 (Fed. Cir. 1999))). 

Kubin also supports Applicants' enablement arguments. As noted above, the 

Kubin Appellants claimed polynucleotides encoding polypeptides having 80% identity 

to a defined amino acid sequence, which has a defined binding activity. Kubin, 

Appeal 2007-0819, at 3. In Appellants' specification, the Board found that 

Appellants' specification taught how to make variants of the defined amino acid 

sequence, how to calculate identity between the defined amino acid sequence and 

the variants, and how to test the variant for the claimed binding activity. Id. at 13. 

The specification did not disclose, however, which amino acids could be changed 

and still retain the claimed activity, and it did not disclose any actual variants of the 

defined amino acid sequence. Id. The examiner in Kubin rejected the claims as 

lacking enablement for sequences having identity to the defined amino acid 

sequence because of the absence of working examples and because changes in 

defined amino acid sequence might alter the function of the variant as compared to 

the defined amino acid sequence. Id. at 10. The examiner there also noted the 

unpredictability of the molecular biology art. Id. at 13. In finding enablement of the 

claimed invention, the Board agreed with the examiner that the molecular biology art 

was unpredictable (Wands factor 7), but "the other Wands factors weigh[ed] in 
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Appellants' favor, particularly the state of the art and the relative skill of those in the 
art as evidenced by the prior art teachings and Appellants' Specification." Id. at 14 
(internal citations and markings omitted). Further, the Board noted that "[t]he amount 
of experimentation to practice the full scope of the claimed invention might have 
been extensive, but it would have been routine . The techniques necessary to do so 
were well known to those skilled in the art." Id. (emphasis added). Like the Kubin 
Appellants, Applicants here provided teachings on how to make variants of SEQ ID 
NO: 18 (see, e.g., Applicants' Specification at page 15, line 4 - page 16, line 22), 
described how to calculate the sequence identities between SEQ ID NO:18 and its 
variants (see, e.g., Applicants' Specification at page 8, line 34 - page 9, line 17), and 
provided the Shannon assay to test for brittle-1 activity. Thus, Kubin dictates that 
Applicants' claims are enabled. 

Applicants further note that, if Applicants' claimed invention is limited to only 
those nucleotide sequences encoding SEQ ID NO: 18 as suggested by the Examiner, 
Applicants' patent rights become essentially useless because the skilled artisan 
could simply modify one amino acid of SEQ ID NO: 18 (the sequence of which is 
undisputedly disclosed in Applicants' Specification), confirm brittle-1 activity by the 
Shannon assay (undisputedly referenced in Applicants' Specification), yet be outside 
the scope of the Applicants' claims even though Applicants' Specification disclosed 
the complete roadmap to working around the exceptionally narrow claims. In 
essence, the Examiner's scope of enablement rejection produces the absurd result 
of Applicants' Specification enabling the skilled artisan to avoid infringement of 
claims covering only nucleotide sequences encoding SEQ ID NO: 18, but the same 
specification failing to enable the same skilled artisan to produce the same modified 
amino acid sequence if the claims cover sequences having 90% identity to SEQ ID 
NO:18. 

Applicants also believe that any of the arguments presented in the 
enablement section should be applicable towards establishing that sufficient written 
description was present in Applicants' Specification as filed and vice versa. As noted 
in LizardTech, "a recitation of how to make and use the invention across the full 
breadth of the claim is ordinarily sufficient to demonstrate that the inventor 
possesses the full scope of the invention, and vice versa." 434 F.3d at 1345, 76 
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USPQ2d at 1733. That the present specification supports possession (written 
description) of the genus of polypeptides encompassed by the present claims (see 
above) further evidences enablement of the present claims. All methods for 
generating the described polypeptide variants were standard in the art at the time of 
filing. Likewise, methods for testing for the required activity were described in 
Applicants' Specification (see above). Thus, the possessed genus is enabled, 
almost by definition. 

In view of the foregoing, Applicants respectfully request withdrawal of the 
Section 112, 1 st paragraph, enablement rejections. 

VIII. CONCLUSION 

For the reasons set forth above, the Board is respectfully requested to reverse 
the final rejection of pending Claims 26-29 and indicate allowability of all claims. 

Please charge any fee due which is not accounted for to Deposit Account No. 
501447 (Potter Anderson & Corroon LLP). 




Jeffrey^lSafran V 
Attorney for Applicants 
Reg. No.: 54,689 
Telephone: (302) 984-6132 
Facsimile: (302) 658-1192 



Dated: \ 0eJt&Lf^30 Tdd y 
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CLAIMS APPENDIX 

26. An isolated polynucleotide comprising: 

(a) a nucleotide sequence encoding a polypeptide having brittle-1 activity, 
wherein the polypeptide has an amino acid sequence of at least 90% 
sequence identity when compared to SEQ ID NO: 18, or 

(b) a full-length complement of the nucleotide sequence of (a). 

27. The isolated polynucleotide of Claim 26, wherein the polypeptide has a 
sequence identity of at least 95%, based on the Clustal method of alignment, when 
compared to SEQ ID NO: 18. 

28. A recombinant DNA construct comprising the polynucleotide of Claim 26 
operably linked to a regulatory sequence. 

29. A vector comprising the polynucleotide of Claim 26. 
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EVIDENCE APPENDIX 

A terminal disclaimer, attached herewith, was filed on December 21 , 2005, 
disclaiming the terminal portion of any patent on this application that would extend 
beyond the expiration date of 6,660,850. The terminal declaimer was reviewed, 
accepted, and recorded. 

Also attached herewith are the following references: Shannon era/., Plant 
Physiol. 117:1235-52 (1998); You L. & Arnold F.H., Protein Eng. 9:77-83 (1996); 
Crameri A. etal., Nat. Biotech. 15:436-38 (1997); Wu T.D. & Brutlag D.L., Proc. Int. 
Conf. Intell. Syst. Mol. Biol. 4:230-40 (1996); Cao H. & Shannon, J.C., Physiol. 
Plantarum 97:665-73 (1996); Shannon J.C. etal., Plant Physiol. 110:835-43 (1996). 
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RELATED PROCEEDINGS APPENDIX 

None 
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Brittle-l, an Adenylate Translocator, Facilitates Transfer of 
Extraplastidial Synthesized ADP-Glucose into Amyloplasts of 

Maize Endosperms 1 

Jack C. Shannon*, Fang-Mei Pien, Heping Cao 2 , and Kang-Chien Liu 

Department of Horticulture, 102 Tyson Building, The Pennsylvania State University, 
University Park, Pennsylvania 16802 



Amyloplasts of starchy tissues such as those of maize (Zea mays 
L.) function in the synthesis and accumulation of starch during 
kernel development. ADP-glucose pyrophosphorylase (ACPase) is 
known to be located in chloroplasts, and for many years it was 
generally accepted that AGPase was also localized in amyloplasts of 
starchy tissues. Recent aqueous fractionation of young maize en- 
dosperm led to the conclusion that 95% of the cellular AGPase was 
extraplastidial, but immunolocalization studies at the electron- and 
light-microscopic levels supported the conclusion that maize en- 
dosperm AGPase was localized in the amyloplasts. We report the 
results of two nonaqueous procedures that provide evidence that in 
maize endosperms in the linear phase of starch accumulation, 90% 
or more of the cellular AGPase is extraplastidial. We also provide 
evidence that the brittle-l protein (BT1), an adenylate translocator 
with a KTGGL motif common to the ADP-glucose-binding site of 
starch synthases and bacterial glycogen synthases, functions in the 
transfer of ADP-glucose into the amyloplast stroma. The importance 
of the BT1 translocator in starch accumulation in maize endosperms 
is demonstrated by the severely reduced starch content in btl 
mutant kernels. 



Starch is synthesized and accumulates in the amyloplasts 
of storage tissues (Shannon and Garwood, 1984; Boyer et 
al., 1989; Smith et al., 1997). The enzymatic reactions cata- 
lyzed by AGPases (EC 2.7.7.27), starch synthases (EC 
2.4.1.21) (Preiss, 1991), SBEs (EC 2.4.1.18) (Cao and Preiss, 
1996; Fisher et al., 1996), and starch-debranching enzymes 
Qames et al., 1995; Rahman et al., 1998) are much better 
understood than the mechanism involved in the transport 
of substrates across the amyloplast envelope membranes 
and the compartmentation of AGPase (Pozu eta-Romero et 
al., 1991; Liu et al., 1992; Okita, 1992; Hannah et al., 1993; 
Villand and Kleczkowski, 1994; Denyer et al., 1996; Pien 
and Shannon, 1996; Shannon et al., 1996; Thorbjornsen et 
al., 1996; Mohlmann et al., 1997). One of the major factors 
hindering progress is the difficulty of isolating highly pu- 
rified intact amyloplasts and amyloplast membranes from 
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storage organs because of the presence of a dense starch 
granule(s) within the fragile envelope membrane (Liu and 
Shannon, 1981; Echeverria et al., 1985; Gardner et al., 1987; 
Shannon et al., 1987; Shannon 1989). 

We recently developed a rapid yet gentle procedure for 
the isolation of intact amyloplasts and their envelope mem- 
branes from immature maize (Zea mays L.) endosperms 
(Cao et al., 1995) and from maize endosperm suspension 
cultures (Cao and Shannon, 1996). Immunological charac- 
terization indicated that Btl encodes the major 39- to 44-kD 
polypeptides of the purified amyloplast membranes, BT1. 
Results from several studies support the possibility that 
BT1 plays a significant role in starch accumulation in maize 
endosperm. For example, BT1 is specifically deficient in the 
amyloplast envelope membranes isolated from btl, a 
starch-deficient endosperm mutant (Cao et al., 1995). 

Shannon et al. (1996) demonstrated that ADP-Glc, the 
direct substrate for starch synthesis, accumulated in btl 
mutant endosperms and that AGPase is the predominant 
enzyme responsible for the synthesis of ADP-Glc in bt 1 . In 
a preliminary report we showed that amyloplasts from 
young kernels isolated from btl endosperms were only 
25% as active in ADP-Glc uptake and conversion to starch 
as amyloplasts from normal and mutant maize endosperms 
(Liu et al., 1992). The amino acid sequence deduced from 
Btl cDNA (Sullivan et al., 1991) shows high homology with 
mitochondrial adenylate translocators from some species, 
and in vitro-synthesized BT1 is targeted to the inner chlo- 
roplast membrane (Li et al., 1992). 

Giroux and Hannah (1994) reported that the BT2 and 
SH2 subunits of AGPase from maize endosperms are 
the same size as the recombinant subunits, and suggested 
that AGPase in maize endosperm may not be located in 
amyloplasts. Denyer et al. (1996) recently provided evi- 
dence that maize endosperm cells contain two isozymes of 
AGPase, with more than 95% of the total activity being 
extra-amyloplastic. All of these data support the sugges- 



Abbreviations: ADH, alcohol dehydrogenase; AGPase, ADP-Glc 
pyrophosphorylase; APase, alkaline pyrophosphatase; BT1 and 
BT2, brittle-l and brittle-2 proteins, respectively; DPP, days 
postpollination; FSB A, S'-p-fluorosulfonylbenzoyl adenosine; 
hexose-P, hexose phosphate; 3-PGA, 3-phosphoglyceraldehyde; 
SBE, soluble starch-branching enzyme; SH2, shrunken-2 protein; 
SS, soluble starch synthase; SUS, Sue synthase; TCE, tetrachloro- 
ethylene; UGPase, UDP-Glc pyrophosphorylase. 
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tion that most of the ADP-Glc required for starch synthesis 
in maize endosperm is synthesized by cytosolic AGPase 
(Denyer et al, 1996), and that BT1 is the adenylate trans- 
locator responsible for the transfer of ADP-Glc into maize 
endosperm amyloplasts (Cao et al., 1995; Sullivan and 
Kaneko, 1995). Homologs of BT1 may be present in the 
amyloplast membranes from other starchy tissues, but they 
are not recognized by the antibodies to BT1 used by Cao et 
al. (1995) and by Cao and Shannon (1996, 1997). 

Cytosolic localization of AGPase in maize endosperm 
cells is not supported by recent immunolocalization stud- 
ies. For example, in a study using a transmission electron 
microscope, Miller and Chourey (1995) reported that pro- 
teins recognized by antibodies to spinach leaf AGPase were 
confined to amyloplasts, whereas antibodies to the peptide 
subunits of maize endosperm AGPase, BT2 and SH2, most 
heavily immunolabeled the amyloplasts and cell walls, 
with lighter labeling of the cytosol. In an in situ immuno- 
localization study at the light-microscopic level, Brangeon 
et al. (1997) observed that BT2 and SH2 antibodies (the 
same source of antibodies used by Miller and Chourey 
[1995]) immunolabeled both the amyloplasts and sur- 
rounding cytosol in pericarp cells from very young kernels, 
but immunolabel in endosperm cells from older kernels 
was closely associated with the amyloplasts only. These 
authors concluded that AGPase was localized in the amy- 
loplast stroma of endosperm cells. However, at this level of 
resolution it is not possible to determine conclusively 
whether the immunolabeled proteins are in the plastid 
stroma or outside the envelope, and they correctly noted 
that the AGPase could have been bound to the outer mem- 
brane of the plastid envelope (Brangeon et al., 1997), and 
thus would partition as a "cytosolic" enzyme during aque- 
ous fractionation. 

A potential drawback of immunocy to localization studies 
of cereal endosperm tissues at the electron-microscopic 
level is the difficulty encountered in sufficiently embed- 
ding the tissues so that the thin slices of starch granules do 
not "pop" out of the plastic before viewing. As a conse- 
quence, only amyloplasts with very small starch granules 
in cells located in the physiologically less-developed parts 
of the endosperm survive preparation for electron- 
microscopic examination. The surviving sections may or 
may not be representative of the entire tissue. Although 
this difficulty is minimized by using thicker sections for 
immunolocalization at the light-microscopic level, the res- 
olution is not adequate to distinguish protein localization 
inside or outside of the plastid membranes. 

There are also drawbacks to studies of enzyme compart- 
mentation based on aqueously isolated amyloplasts. For 
example, during aqueous isolation most of the amyloplasts 
with starch granules larger than 1 or 2 /xm in diameter are 
ruptured and the resulting preparation is enriched with 
amyloplasts containing smaller starch granules. To obtain 
the highest yield of intact amyloplasts, endosperms from 
very young kernels just beginning starch accumulation are 
used (Shannon et al., 1987). Activities of AGPase and SS 
were very low or undetectable in endosperms from 12-DPP 
kernels (Tsai et al., 1970), and Brangeon et al. (1997) 
showed a gradient of expression of the genes encoding 



AGPase from the periphery of the endosperm toward the 
center, with central endosperm cells of kernels 15 DPP 
most intensely immunolabeled by antibodies to BT2 and 
SH2. As a consequence, enzyme compartmentation in amy- 
loplasts from very young kernels or from the physiologi- 
cally younger cells near the periphery of the endosperm 
may not be representative of compartmentation in amylo- 
plasts from those cells most actively engaged in starch 
biosynthesis. 

To overcome these difficulties we developed two non- 
aqueous fractionation procedures to determine compart- 
mentation of enzymes in amyloplasts from maize kernels 
in the linear phase of starch accumulation (about 20 DPP). 
Results of these studies are compared with results of an 
aqueous subcellular fractionation/ immunoblotting study. 
Finally, results of a study of the uptake and incorporation 
into starch of metabolites by intact amyloplasts isolated 
from normal and mutant endosperms are reported. These 
studies support the conclusion that maize endosperm cells 
contain an extraplastidial form of AGPase, and that the 
amyloplast membrane-specific polypeptide, BT1, is an 
adenylate translocator. 



MATERIALS AND METHODS 

Plant materials were either grown in the field at the 
Russell E. Larson Agricultural Research Farm (Centre 
County, PA) or grown in 20-L plastic pots containing two 
parts peat, two parts perlite, and one part soil. Potted 
plants were grown in the greenhouse in late winter and 
spring or were started in the greenhouse in the spring and 
then transferred outside the greenhouse for continued 
growth. High-intensity sodium lamps were used in the 
greenhouse to extend the day length to 16 h. Unless noted 
otherwise, the normal maize (Zea mays L.) inbred W64A 
and the endosperm mutant genotypes waxy (wx), brittle-1 
(btl), and shrunken-2 (sh2) in a near-isogenic W64A back- 
ground were used in these studies. 



Nonaqueous Compartmentation Studies 

Estimating Amyloplast Compartmentation of Enzymes after 
Fractionation in Mixtures of TCE and Heptane 

Endosperms from 20-DPP W64A inbred kernels were 
removed, frozen in liquid nitrogen, and freeze-dried. Pul- 
verized samples were sifted through a 20- jim sieve using a 
sonic sifter fitted with a horizontal pulse generator (ATM 
Corp., Milwaukee, WI). The TCE /heptane procedure was 
patterned after methods used by Riens et al. (1991) and 
MacDougall et al. (1995). A total of 400 mg of dry, sifted 
endosperm in 50-mg batches was homogenized in 15-mL 
polypropylene centrifuge tubes in 2 mL of dry TCE using 
an ultrasonic probe (Biosonic IIA, Bronwill Scientific, Ro- 
chester, NY) for a total of 2.5 min using 30-s bursts. During 
homogenization the tube was held in a 95% ethanol/dry 
ice bath to reduce heating of the sample. Molecular sieve 
beads (4 A, Sigma) were added to all TCE and heptane 
solutions before use to remove all traces of water, and 
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care was taken to keep all tubes tightly closed whenever 
possible. 

The TCE homogenates were combined and n-heptane 
added to give a TCEiheptane mixture of 85:15 (v/v). Ali- 
quots of this mixture were removed for "total- 
homogenate" analyses. The balance of the mixture was 
dispensed into several microcentrifuge tubes and the cel- 
lular contents were fractionated into amyloplast- and 
cytosol-enriched fractions by differentially pelleting the 
starch granules and associated enzymes from TCE /hep- 
tane mixtures of varying density. For example, the most 
dense fraction was pelleted from the 85:15 (v/v) TCE: 
heptane suspension by centrifugation in the cold (4°C) for 
5 min at 16,000£. The pellet (pellet A) was retained and the 
supernatant was diluted with heptane to a final TCE:hep- 
tane ratio of 83:17 (v/v). Centrifugation was then repeated 
to yield pellet B. The resulting supernatant was again di- 
luted with heptane to a ratio of 75:25 (v/v) and centrifuged 
as before to yield pellet C. 

The 75:25 (v/v) TCE:heptane supernatant yielded the 
cytosol-enriched fraction. Aliquots of the initial unfraction- 
ated homogenate and the final cytosol-enriched superna- 
tant were diluted with 3 volumes of heptane, and the 
particulate material in these heptane-diluted samples was 
collected by centrifugation in the cold for 10 min at 3000g. 
The clear supernatants were discarded and all pellets were 
held overnight at 4°C in a vacuum desiccator containing 
paraffin oil and silica-gel desiccant to remove the residual 
TCE and heptane. The dried pellets were extracted for 
enzyme analysis and the number of starch granules was 
determined. The TCE/heptane fractionation was repeated 
three times. 

Amyloplasts in W64A endosperms each contain a single 
starch granule (Liu and Shannon, 1981); therefore, starch 
granule number was used as a measure of amyloplast 
number in the unfractionated homogenates and in the 
TCE/heptane fractions. We determined that the enzyme 
activities per million starch granules in the two most dense 
TCE/heptane fractions (pellets A and B) were very similar, 
and thus the means of both fractions ± se (six values) were 
plotted. Likewise, activities per million starch granules in 
the unfractionated homogenate and in aqueous extracts of 
the sifted endosperm samples were very similar, and the 
means ± se (six values) of these were plotted. The data 
from pellet C and supernatant fractions are the means ± se 
of the three fractionations. To estimate compartmentation 
of an enzyme in amyloplasts, the average enzyme activity 
per million starch granules (y axis) from the four fractions 
(pellets A and B, unfractionated homogenate/ aqueous ex- 
tract, pellet C, and the supernatant) was plotted against the 
activity of a cytosol or vacuole marker enzyme per million 
starch granules from the same fractions (x axis). The y 
intersect of a regression line gives an estimate of enzyme 
activity per million amyloplasts in the absence of cytosol or 
vacuole contamination. 

Glycerol Isolation of Amyloplasts 

A procedure for the nonaqueous isolation of starch gran- 
ules with associated metabolites (amyloplasts) from maize 



endosperm amyloplasts was reported previously (Liu and 
Shannon, 1981). In that procedure the dry endosperm sam- 
ple was homogenized in dry glycerol and filtered through 
Miracloth (Calbiochem), and the starch granules were pel- 
leted through a more dense solution of 3-Cl-l,2- 
propanediol. Although this procedure yielded an amylo- 
plast fraction essentially free of nuclear and cytosolic 
contaminants, the starch biosynthetic enzymes were inac- 
tivated. We determined that inactivation was caused pri- 
marily by excessive heating of the sample during homog- 
enization in glycerol and by exposure to 3-Cl-l,2- 
propanediol. The glycerol nonaqueous isolation procedure 
was therefore modified for the enzyme-compartmentation 
study. Fifty milligrams of sifted endosperm as used above 
was added to a microcentrifuge tube containing 1 mL of 
dry, cold (4°C) glycerol. The sample was thoroughly dis- 
persed using a disposable plastic microtube pestle. The 
microcentrifuge tube was closed and placed on ice, and 
centrifugation was carried out at 4°C for 20 min at 25,000g. 
The supernatant was transferred to a 15-mL centrifuge tube 
and the pellet was washed with 0.5 mL of dry, cold glycerol 
by suspension using the microtube pestle and centrifuga- 
tion as above. After the wash supernatant was added to the 
initial supernatant, the wall of the microcentrifuge tube 
was wiped with a tissue to remove excess glycerol. The 
combined supernatants and pellet were retained for en- 
zyme analyses. Total enzyme activity in aqueous extracts of 
the sifted endosperm was also determined. 

Enzyme Extraction and Assay 

Duplicate samples of the TCE/heptane homogenate and 
of the four fractions were retained for enzyme analysis. 
Each fraction retained for extraction and enzyme assay was 
derived from approximately 42 mg of sifted endosperm. 
For the controls, duplicate subsamples (50 mg) of the sifted 
endosperm were also extracted and enzyme activities de- 
termined. Soluble enzymes were extracted from all pellet 
fractions and the sifted endosperm samples (TCE/heptane 
and glycerol) with 2 mL of HSB extraction buffer (50 mM 
Hepes, pH 7.5, 0.5 m sorbitol, 10 mM KC1, 1 mM 
MgCl 2 -6H 2 0, 1 mM EDTA, 5 mM dithioerythritol, and 0.1% 
BSA) by sonication for four 10-s bursts with 10-s rest peri- 
ods between each burst using the Biosonic IIA ultrasonic 
probe set at 60% maximum power. The tubes were held in 
an ice bath during sonication. 

The glycerol supernatants (approximately 1.5 mL) were 
diluted to 5 mL with the HSB extraction buffer. The ho- 
mogenates and diluted glycerol supernatants were centri- 
fuged in the cold (4°C) for 10 min at 3000# and the super- 
natants retained for enzyme assay. Extracts from one set of 
the TCE/heptane pellets were used for assay of AGPase, 
UGPase, and ADH. AGPase and UGPase were assayed by 
the coupled-spectrometric method as described by Oh-Lee 
and Setter (1985), except that the AGPase and UGPase 
assays were started by the addition of 0.4 mM ADP-GIc and 
0.4 mM UDP-Glc, respectively, and ADH as described by 
Cao et al. (1995). A small number of fractions were ex- 
tracted at a time and AGPase, the most labile enzyme of the 
three, was assayed first. Extracts from the second set of 
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TCE /heptane pellets were used for assay of a-mannosidase 
as described by Boiler and Kende (1979), for assay of APase 
as described by Gross and ap Rees (1986), and for assay of 
SBE and SS as described by Shannon et al. (1996). SUS was 
assayed in the hydrolytic direction as described by Echev- 
erria et al. (1988), except that after heat inactivation, the 
quantity of Fru released was determined by a reducing 
sugar test as described previously (Shannon et al., 1996). 
The data were corrected for any Fru released in the absence 
of added UDP. HSB extracts of the glycerol fractions were 
used for assay of ADH, UGPase, AGPase, SBE, SS, and 
APase as described above. 

Starch Granule Number 

The number of starch granules remaining in the pellets 
after HSB extraction for enzyme assays was determined as 
described by Shannon et al. (1996). 

Aqueous Compartmentation Study 

Purification of Amyloplasts and Separation of Amyloplast 
Membranes and Stroma 

Crude and Percoll-purihed amyloplasts were isolated 
from the endosperm of freshly harvested developing ker- 
nels (13-16 DPP) as described by Cao et al. (1995). The 
purified amyloplast pellet was suspended in a small vol- 
ume of TDEP buffer (10 dim Tricine, pH 7.2, 1 mM DTT, 1 
mM EDTA, and 0.5 mM PMSF), and the amyloplasts were 
lysed by one cycle of freezing at -70°C and thawing at 
30°C. After removal of starch granules by centrifugation at 
800g, amyloplast stroma was separated from amyloplast 
membranes by centrifugation at 100,000g for 60 min. Amy- 
loplast membranes were further purified from the crude 
membrane pellet through a discontinuous Suc-density gra- 
dient as described by Cao et al. (1995). The purified amy- 
loplast membrane pellet was suspended in TDEP buffer 
plus 0.2 M Sue and stored at -70°C. 

Isolation of Microsomal Membranes 

Freshly isolated endosperms and other tissues were ho- 
mogenized in a buffer containing 0.4 m Sue, 50 mM Mops, 
pH 6.9, 10 mM DTT, 1 mM EDTA, 0.1 mM PMSF, and 0.1% 
(w/v) BSA, and the homogenate was fractionated by dif- 
ferential centrifugation at 2,000g (P2), 10,000g (P10), and 
100,000g (P100) according to the method of Cao et al. 
(1995). The P2 and P10 pellets and the microsomal mem- 
brane pellet (P100) were suspended in TES buffer (10 mM 
Tricine, pH 7.2, 1 mM EDTA, and 0.2 m Sue) and stored at 
-70°C. 

Marker-Enzyme Analysis 

SBE (an amyloplast marker) and ADH (a cytosol marker) 
were assayed as described above. Catalase (EC 1.11.1.6) (a 
marker for microbodies), Cyt c oxidase (EC 1.9.3.1) (a 
marker for mitochondria), cyanide-insensitive NADH-Cyt 
c reductase (EC 1.6.99.3) (a marker for the ER), and 



vanadate-sensitive ATPase (EC 3.6.1.4) (a marker for 
plasma membrane) were assayed as described by Cao et al 
(1995). Potassium-stimulated ATPase (a marker for plasma 
membrane), Triton-stimulated UDPase (a marker for the 
Golgi), and nitrate-sensitive ATPase (a marker for the tono- 
plast) were assayed as described by Briskin et al. (1987). 
Protein contents were measured using the Bradford 
method plus NaOH treatment (Cao et al., 1995). 



SDS-PACE and Immunoblotting 

Proteins were solubilized and denatured in IX SDS gel- 
loading buffer by heating the samples in a boiling-water 
bath for 5 min. Polypeptides were separated by SDS-PAGE 
and stained with Coomassie brilliant blue R-250 (Cao et al., 
1995). Standard procedures were used for immunoblotting, 
as described previously (Cao et al., 1995). The polyclonal 
antibodies to maize SH2 and BT2 were gifts from Michael 
Giroux and L. Curtis Hannah, University of Florida (Gir- 
oux and Hannah, 1994), and the polyclonal antibodies to 
maize BT1 were a gift from Thomas D. Sullivan, University 
of Wisconsin-Madison (Sullivan and Kaneko, 1995). The 
relative quantities of BT1, SH2, and BT2 were determined 
by scanning densitometry (model 300B, Molecular Dynam- 
ics, Sunnyvale, CA) using a method similar to that de- 
scribed by Cao et al. (1995). 

Calculation of the Cellular Localization of the BT2- and 
SH2- Antibody-Reacting Polypeptides of AGPase 

The following data and calculations were used to deter- 
mine the percentages of the SH2 and BT2 polypeptides of 
AGPase in the cytosol and amyloplasts. Total protein in the 
homogenate and crude amyloplast fraction (see Table III) 
was 3480 and 290 mg g" 1 fresh weight, respectively. 
Thirty-one percent of the amyloplast marker enzyme (SBE) 
was recovered in the crude amyloplast fraction (amyloplast 
yield). The crude amyloplast fraction was contarninated 
with 0.7% cytosol (percentage of the cytosol marker en- 
zyme ADH). Based on equal loading of proteins and as 
determined by immunoblotting analysis and densitometer 
scanning (see Fig. 4), we estimated that the crude amylo- 
plast fraction contained 39% and 18% of the cellular SH2 
and BT2 polypeptides, respectively. 

To determine the percentage of SH2 compartmented in 
the cytosol, we set the homogenate (the crude amyloplast 
fraction [A] and the cytosol fraction [C]) containing 100% 
SH2 X 3480 mg of protein = 3480 SH2 units; and the SH2 
in the crude amyloplast {A + 0.7% of C) fraction = 39% 
SH2 polypeptide X 290 mg of protein/31% (amyloplast 
yield) = 365 SH2 units. To solve for C: (A + C) - (A + 
0.007C) = 3480 - 365; 0.993C = 3115; C = 3137 SH2 units 
and A = 3480 - 3137 = 343 SH2 units. Therefore, the 
percentage of SH2 in the cytosol = 3137/3480 x 100 = 
90.1%. Based on a similar calculation we determined that 
the cytosol contained 95.8% of the cellular BT2 polypep- 
tide, for an average of 93% of the cellular BT2- and SH2- 
antibody-reacting polypeptides of AGPase localized in the 
cytosol. 
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Metabolite- Uptake Studies 

Radioactive Metabolites and Chemicals 

Radioactive Glc-l-P, Glc-6-P, and ADP-Glc, uniformly 
14 C labeled in the carbohydrate moiety, were purchased 
from ICN. Substrates, cofactors, inhibitors, and enzymes 
were obtained from Sigma, and all other chemicals used 
were analytical reagent grade. 

Aqueous Amyloplast Isolation and Purification 

Endosperms were removed from kernels 10 to 16 DPP 
(the precise ages are given in the tables) and homogenized 
in approximately 1 volume (w/v) of homogenization 
buffer (50 mM Hepes, pH 7.5, 0.5 m sorbitol, 10 mM KC1 , 1 
mM MgCl 2 , 1 mM EDTA, 0.1% BSA, and 5 mM dithioeryth- 
ritol) for 2 s at top speed in a homogenizer (VirTis 23, The 
VirTis Co., Gardiner, NY). The homogenate was gently 
filtered through Miracloth and an aliquot layered on a 
gradient of 10%, 20%, and 40% Percoll in the homogeniza- 
tion medium. The gradient was centrifuged for 5 min at 
200g and the amyloplasts settling in the 20% Percoll layer 
were removed and used for the uptake studies. An aliquot 
of each preparation was removed to determine amyloplast 
intactness by measuring SBE activity before and after lysis, 
as described previously (Shannon et al., 1987). 

Metabolite Uptake and Incorporation 

For uptake and incorporation of ADP-Glc, amyloplasts 
(60-80 /xL) were added to a reaction mixture (200 /iL final 
volume) containing 100 mM Bicine, pH 8.5, 0.5 m sorbitol, 
12.5 mM EDTA, 10 mM GSH, 50 mM KC^Oj, and 4 mM 
[ 14 C]ADP-Glc (the specific activity varied from 64 to 300 
cpm/nmol). To determine the effect of ATP, ADP, or AMP 
on [ 14 C] ADP-Glc uptake and incorporation, the individual 
nucleotides (7 mM) were added to the amyloplasts in the 
20% Percoll isolation buffer and incubated on ice for 30 min 
before an aliquot of the amyloplast suspension was added 
to the uptake medium (final nucleotide concentration in the 
uptake medium was 2.2 mM). To determine whether a 
translocator with an adenosine-binding site functions in 
the uptake of [ 14 C]ADP-Glc, the amyloplasts were prein- 
cubated for 30 min at 30°C in the uptake mixture contain- 
ing varying concentrations of FSBA before the addition of 
( 14 C]ADP-Glc. In the FSBA study each uptake solution 
contained 2% DMSO, the solvent for FSBA. 

To determine the uptake and incorporation of Glc-l-P and 
Glc-6-P, amyloplasts (60-80 fxL) were added to a reaction 
mixture (200 p,L final volume) containing 15 mM Hepes, pH 
7.5, 0.5 m sorbitol, 10 mM MgCl 2 , 12.4 or 0.5 mM 3-PGA, 
0.08% BSA, 0.1 unit of inorganic pyrophosphatase, and 2 mM 
[ ,4 C]Glc-l-P (about 200 cpm/nmol) or [ 14 C]Glc-6-P (about 
200 cpm/nmol). ATP at 2 mM and rabbit liver glycogen at 1 
mg per uptake reaction were added as indicated. 

All uptake studies were completed with intact amylo- 
plast preparations and with lysed amyloplast preparations. 
There were no differences in the results when the amylo- 
plasts were lysed either by including 1% Triton X-100 in the 



uptake medium or by brief sonication (four times for 10 s 
each, with cool-down periods between) of the uptake me- 
dium containing amyloplasts before the addition of the 
14 C-metabolite. Unless noted otherwise the uptake reac- 
tions were carried out at 30°C and were terminated after 
120 min by addition of 2 mL of 75% methanol containing 
1% KC1. The alcohol-insoluble pellet was collected by cen- 
trifugation (2000g) in the cold for 10 min, and was washed 
twice with the methanol/KCl solution by suspension and 
centrifugation as above. The alcohol-washed pellets were 
then extracted three times with water by suspension and 
centrifugation as described above. The quantities of 14 C 
product in the water-soluble and -insoluble fractions were 
determined using a liquid-scintillation analyzer (Tri-Carb 
1500, Packard Instrument Co., Downers Grove, IL). 

An aliquot from each amyloplast isolation used for up- 
take studies was retained to determine the number of 
starch granules in each uptake reaction. Starch granule 
number was as determined previously (Shannon et al., 
1996). Uptake and incorporation data are presented as the 
amount per million starch granules (amyloplasts). It is 
assumed that each amyloplast settling in the 20% Percoll 
layer contains one starch granule. 

Protein-Sequence Analysis 

The protein sequences used in the analysis were obtained 
from the literature and searched from the database of the 
National Center for Biotechnology Information. The loca- 
tions of amino acid residues indicated in the tables corre- 
spond to the translated full-length sequences instead of the 
"mature" sequences. The sequence analysis was conducted 
as described previously for the alignment of branching 
enzymes (Cao and Preiss, 1996). 

Transmission Electron Microscopy 

Kernels were removed from 20-DPP btl ears and small 
portions of endosperm were removed from the middle of 
the kernel and fixed for 4 h at room temperature in 4% 
glutaraldehyde in 100 mM cacodylate buffer, pH 7.0. The 
samples were postfixed for 1 h in 1% osmium tetroxide in 
the same buffer and then dehydrated through an ethanol 
series and embedded in the ultra-low-viscosity medium 
(VCD/HXSA) described by Oliveira et al. (1983). Silver to 
gold sections were cut using a diamond knife and a mic- 
rotome (model III-8800, LKB, Bromma, Sweden) and exam- 
ined by a transmission electron microscope (model 
1200EXII, Jeol) either without additional staining or after 
staining with uranyl acetate and lead citrate. 

RESULTS 
Nonaqueous Fractionations 

TCE/Heptane Fractionation 

The percentage of recovery of starch granules and the 
activity of selected enzymes in the three fractions pelleting 
at various densities and the supernatant were all in excess 
of 80% of that in the original homogenate (Fig. 1). Forty- 
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Figure 1 . Distribution of starch granules and the activity of SUS, ADH, UGPase, AGPase, a-mannosidase (a-Mann), SBE, SS 
(St Syn), and APase in TCE/heptane fractions of different densities. The most dense fraction, pellet A (PA), was enriched in 
amyloplasts, and the least dense fraction, the supernatant (SUP) fraction, was enriched in cytosol. The distribution as a 
percentage of the sum of starch granules and enzyme activities from normal W64A endosperms and recovery of each as a 
percentage of the homogenate are recorded next to the figure key. Data are means ± se of three fractionations. PB, Pellet 
B; PC, pellet C. 



five percent of the starch granules was recovered in the 
most dense fraction (pellet A) and about 5% was recovered 
in the least dense fraction (the supernatant fraction). The 
amyloplast marker enzymes SBE and SS partitioned most 
closely with the starch granules, and the cytosol and 
vacuole marker enzymes, SUS, ADH, UGPase, and 
a-mannosidase, were low in the pellet A fraction and high 
in the supernatant fraction (Fig. 1). AGPase partitioned 
most closely with the cytosol marker enzymes and APase 
partitioned intermediate between the amyloplast and cy- 
tosol marker enzymes (Fig. 1). 

The quantities of SBE, SS, APase, and AGPase associated 
with amyloplasts were determined from the y intercepts of 
the simple regression of the plot of target-enzyme activity 
per 10 6 starch granules versus nonplastid marker-enzyme 
activity per 10 6 starch granules. From this analysis we 
determined that 71%, 77%, and 58% of the putative amy- 
loplast marker enzymes SBE, SS, and APase, respectively, 
were associated with the amyloplasts (Table I). The quan- 



tity of AGPase associated with the amyloplasts varied de- 
pending on the nonplastid marker used for the plot, but it 
is clear that little if any AGPase was recovered with the 
amyloplasts. These results are based on enzyme activities 
readily extracted in aqueous buffer solutions and are not 
expected to include the more tightly bound starch-granule- 
associated starch-synthase I and SBE II reported by Mu- 
Forster et al. (1996). 

We intended to use the TCE/heptane procedure to esti- 
mate enzyme compartmentation in endosperm samples 
from the btl mutant genotype. However, a critical part of 
the determination is an accurate count of the number of 
starch granules. We found that some of the TCE/heptane 
fractions contained a mixture of small starch granules 
(about 2 jinn) and very small starch granules (less than 1 
/jim). Even though we stained the samples with iodine, we 
were unable to distinguish the smallest starch granules 
from protein bodies. In addition, the compartmentation 
calculation assumes that each amyloplast contains a single 
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Table I. Compartmentation of enzymes in amyloplasts from W64A endosperm as determined using the TCE/heptane fractionation method 

SBE, SS, APase, and AGPase activities per million starch granules (/axis) in the TCE/heptane fractions were individually plotted against the 
activities per million starch granules of SUS, UGPase, ADH, and a-mannosidase (a-Mann), the nonplastidial marker enzymes. Estimates of SBE, 
SS, APase, and AGPase activities in amyloplasts per million starch granules were determined from the y intercept of a simple regression line from 
each individual plot. Activities of SBE, SS, APase, and AGPase per million starch granules in the total homogenate (Homo) are included. Data 
from three separate TCE/heptane fractionations were plotted. 
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starch granule. This is true for the normal inbred W64A 
and all maize endosperm mutant genotypes in the W64A 
background examined to date except for sul (Shannon and 
Garwood, 1984). However, the presence of the very small 
starch granules in the btl samples caused us to question 
this assumption and we prepared fresh btl endosperm 
samples for transmission electron microscopic examina- 
tion. From this examination it is clear that endosperm cells 
from 20-DPP btl kernels contain two populations of amy- 
loplasts: simple amyloplasts with a single starch granule 1 
to 5 jLim in diameter, and compound amyloplasts, contain- 
ing several very small starch granules (1 /im or less) (Fig. 
2). Therefore, we were unable to accurately estimate en- 
zyme compartmentation in btl endosperm cells by the 
TCE/heptane procedure. 

Nonaqueous Glycerol Isolation 

The recovery of enzyme activity after the nonaqueous 
glycerol fractionation procedure varied between 40% and 
117% of the enzyme activities measured after extraction in 
the HSB buffer (Table II). The glycerol-isolated amyloplast 
pellets contained only 8% and 7% of the cytosol marker 
enzymes, ADH and UGPase, respectively, and 14% of the 
AGPase activity (Table II). Thus, if we assume that the 
glycerol-isolated amyloplast pellet contains 7% cytosol 
contamination, then 7% of the cellular AGPase was com- 
partmented in the amyloplasts. In contrast, 95%, 79%, and 
38% of the recovered activities of the amyloplast marker 
enzymes SBE, SS, and APase partitioned with the glycerol- 
isolated amyloplasts, respectively. It is important to note 
that although the sum of APase activities in the glycerol 
supernatant and pellet fractions was 17% higher than in the 
HSB extract, only 56%, 40%, and 60% of the HSB- 
extractable activities of AGPase, SBE, and SS, respectively, 
were recovered in the glycerol-supernatant-plus-pellet 
fractions. The aliquots of HSB-diluted glycerol supernatant 
needed for assay of ADH, UGPase, and AGPase were 
smaller than those needed for assay of the plastid enzymes 
SBE and SS. 



In a separate study we determined that the 0.3% to 1.5% 
of glycerol carried over from the diluted glycerol superna- 
tant into the reaction mixtures was not inhibitory to ADH, 
UGPase, and AGPase, but that the 6% glycerol carried over 
into the assay mixtures for SBE and SS reduced measurable 
SBE and SS activities by approximately 60% and 25%, 
respectively, compared with assays in the absence of glyc- 
erol (data not shown). Thus, glycerol inhibition in the 
glycerol supernatants may contribute to the low recoveries 
of SBE and SS activities and inflate apparent partitioning of 
these enzymes in the glycerol-pellet fraction. When the 
percentage recoveries of the amyloplast marker enzymes 
associated with the glycerol pellets were calculated as per- 
centages of the activity in the HSB extracts, we estimated 
that the glycerol pellets contained 38%, 47%, and 46% of the 
total cellular SBE, SS, and APase, respectively. 

Activities of AGPase, SBE, SS, and APase per million 
starch granules in the glycerol-isolated amyloplasts (Table 
II) were very similar to the enzyme activities associated 
with the amyloplasts, as estimated by the TCE/heptane- 
fractionation procedure (Table I). It is clear from the results 
of these studies that a much higher percentage of AGPase 
partitions in the cytosol fraction compared with the amy- 
loplast marker enzymes. In addition, we have shown that 
the nonaqueous glycerol procedure may be used to isolate 
starch-granule preparations from kernels in mid develop- 
ment (20 DPP), which contain almost half of the soluble 
stromal enzymes but are relatively free of cytosol marker 
enzymes. 



Aqueous Fractionation and Immunolocalization 

Preparation of Amyloplasts from Developing 
Maize Endosperm 

As a second approach to confirm the subcellular local- 
ization of AGPase in maize endosperm, we isolated intact 
amyloplasts from 13-DPP endosperm. Typical examples of 



1242 



Shannon et al. 



Plant Physiol. Vol. 117, 1998 





Figure 2. Low-magnification transmission electron photomicrographs of the parts of three cells in endosperm from a 20-DPP 
bt1 mutant kernel. The section in the top micrograph was not poststained and that in the bottom micrograph was poststained 
with uranyl acetate and lead citrate. The large starch granules in the simple amyloplasts show dark artifacts that formed 
during sectioning because of the hydration and folding of the thin slices of starch. The compound amyloplasts contain many 
small starch granules. Bars = 2 /urn. 



the yield and purity of the aqueously isolated amyloplasts 
are summarized in Table III. Partially purified amyloplasts 
recovered in the lOOg pellet (crude amyloplasts) contained 
approximately 31% of the amyloplast marker enzyme SBE 
and less than 1%, 4%, and 2% of the cytosol marker (ADH), 
the mitochondrial marker (Cyt c oxidase), and the ER 



marker (cyanide-insensitive NADH-Cyt c reductase) en- 
zymes, respectively. Purification of the amyloplasts 
through a Percoll density gradient effectively removed all 
of the cytosol, mitochondria, and ER contaminants, but 
only 13.9% of the ex tractable SBE was retained with the 
more highly purified amyloplasts. Therefore, during Per- 



Brittle-"! Transfers ADP-GIc into Maize Endosperm Amylopiasts 



1243 



Table II. Partitioning of cytosol and amyloplast marker enzymes and ACPase in glycerol supernatant and pellet fractions 

Freeze-dried 20-DPP W64A endosperm tissues were pulverized and sifted through a 20-/^m sieve. Samples of sifted endosperm were 
homogenized in glycerol (Cly) and separated into supernatant and pellet fractions. The activities in these fractions were compared with the total 
activities in subsamples of the sifted endosperm extracted in HSB buffer. Activities are presented per 50 mg of sifted endosperm sample and per 
million starch granules. Data are the average ± se of the number of fractionations (shown in parentheses). 
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UGPase (4) 


HSB 






7935 ± 1860 






71.49 




Gly-HSB 


6457 ± 1009 


483 ± 171 


6940 ± 1055 


87.5 


7.0 ± 2.5 


4.76 


AGPase (5) 


HSB 






649.5 ± 108.7 






5.85 




Gly-HSB 


309.4 ± 72.4 


51.2 ± 17.9 


360.6 ± 88.3 


55.5 


14.2 ± 1.9 


0.50 


SBE (4) 


HSB 






6716 ± 920 






60.51 




Gly-HSB 


130 ± 59 


2571 ± 187 


2701 ± 228 


40.2 


95.2 ± 1.8 


25.33 


SS(4) 


HSB 






36.3 ± 6.5 






0.33 




Gly-HSB 


4.5 ± 0.6 


17.2 ± 2.3 


21.7 ± 2.9 


59.8 


79.3 ± 1.1 


0.17 


APase (6) 


HSB 






221.7 ± 39.2 






1.99 




Gly-HSB 


162.0 ± 32.3 


98.4 ± 16.5 


260.4 ± 44.3 


117.5 


37.9 ± 3.8 


1.01 



coll purification many of the amylopiasts were ruptured, 
releasing SBE from the amylopiasts. The crude and puri- 
fied amyloplast preparations contained 8.3% and 3.4% of 
the homogenate protein, respectively, and the specific ac- 
tivity of SBE relative to that in the homogenate was en- 
riched 3.7- and 4.1 -fold in the crude and PercoU-purified 
amyloplast fractions, respectively (Table III). 

Other cellular components such as the microbodies, 
plasma membrane, Golgi, and tonoplast cosedimented 
with the crude amyloplast preparation, resulting in a 2.2- to 
5.5-fold increase in specific activity of these marker en- 
zymes. However, after Percoll purification the amyloplast 
fraction was essentially free of catalase, the microbody 
marker, and contained only 0.6% of the cellular vanadate- 
sensitive ATPase, one of the markers for plasma mem- 



branes (Table HI). The purified amylopiasts contained 5.9% 
of the cellular potassium-stimulated ATPase, a second pu- 
tative plasma-membrane marker, but the specific activity 
had declined from 0.1 to 0.04. Likewise, the percentages of 
Triton-stimulated UDPase, a Golgi marker, and nitrate- 
sensitive ATPase, a tonoplast marker, were reduced to 
4.2% and 3.5%, respectively, and their specific activities 
were much lower than in the crude pellet (Table III). 

Membranes isolated from Percoll-purified amylopiasts 
were very yellow, with an absorption spectrum character- 
istic of carotenoids (plastid membrane marker): absorption 
peaks at 458 and 488 nm (data not shown). This membrane 
fraction was much enriched in the amyloplast membrane- 
specific polypeptide BT1 (Cao et al., 1995) compared with 
the total microsomal membranes (Fig. 3). 



Table III. Yield and purity of amylopiasts isolated from developing maize endosperm 

Amylopiasts were isolated from 13-DPP endosperm from cv Pioneer 3780 kernels. Aliquots were assayed for protein and marker enzymes after 

filtration through Miracloth (Homogenate), the first lOOg centrifugation pellet (Crude Amylopiasts), and the lOOg Percoll density-gradient- 

centrifugation pellet (Purified Amylopiasts). AN samples were suspended in homogenization buffer and lysed by one freeze-and-thaw cycle before 

the starch was removed by centrifugation and the soluble protein content and marker enzyme activities were determined. Data are the average 

of two or three determinations. Homogenate protein is presented as milligrams per gram fresh weight and all enzyme results are presented as 

nanomoles per minute per gram fresh weight. Enzyme activities in the crude and purified amylopiasts are presented as a percentage of the activity 

in the homogenate (% of Homo) and as specific activity (Spec Act) nanomoles per minute per milligram protein. 

Homogenate Crude Amylopiasts Purified Amylopiasts 
Marker Enzyme Compartment 







Activity 


Spec Act 


% of Homo 


Spec Act 


% of Homo 


Spec Act 


Protein 




3,480 




8.3 




3.40 




SBE 


Amyloplast 


1,800 


0.517 


31.1 


1.931 


13.90 


2.137 


ADH 


Cytosol 


3,913 


1.124 


0.7 


0.090 


0 


0 


Cyt c oxidase 


Mitochondria 


69 


0.020 


3.6 


0.009 


0 


0 


NADH Cyt c reductase 3 


ER 


1,113 


0.320 


1.7 


0.064 


0.02 


0.002 


Catalase 


Microbodies 


81,300 


23.362 


18.5 


51.724 


0.04 


0.256 


Vanadate-sensitive ATPase 


Plasma membrane 


143 


0.041 


18.3 


0.090 


0.60 


0.008 


Potassium-stimulated ATPase 


Plasma membrane 


74 


0.021 


39.9 


0.102 


5.90 


0.038 


Triton X-1 00-stimulated UDPase b 


Golgi 


121 


0.035 


46.6 


0.194 


4.20 


0.044 


Nitrate-sensitive ATPase 


Tonoplast 


76 


0.022 


33.6 


0.088 


3.50 


0.023 



a Cyanide-insensitive activity. b Color reagent without 1 .5% SDS was used. 
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Figure 3. Immunolocalization of BT1 polypeptides in maize amylo- 
plast membranes (lane 1), in microsomal membranes (P100) from the 
embryo, pericarp, and endosperm tissues (lanes 2, 3, and 6, respec- 
tively), and in pellets forming at 2,000g(P2) and 1 0,000g (PI 0) (lanes 
4 and 5, respectively). The amyloplast membranes were isolated from 
endosperm amyloplasts purified from immature (approximately 
1 2-1 5 DPP) Doebler 66XP hybrid kernels, and the P2, PI 0, and P1 00 
fractions were isolated from 13-DPP Pioneer 3780 hybrid kernels as 
described previously (Cao et al., 1 995). Polypeptides were separated 
by SDS-PACE (15% separating gel), transferred to a nitrocellulose 
filter, and probed with polyclonal antibodies raised against a fusion 
protein containing 56 amino acids of the C terminus of BT1 and 
glutathione 5-transferase (Sullivan and Kaneko, 1995). Lane 1 con- 
tained 8 /i,g of amyloplast-membrane protein; all other lanes con- 
tained 30 fig of protein. The relative quantities of BT1 in the various 
lanes as shown on the figure were determined by densitometry. 

Immunolocalization of the BT2- and SH2-Antibody- 
Reacting Polypeptides of AGPase 

Because the enrichment of amyloplasts based on the 
specific activity of SBE in the crude and Percoll-purified 
amyloplast preparations was similar but the yield of the 
amyloplast marker in crude amyloplasts was much higher 
than that in the purified amyloplast preparation, we chose 
crude amyloplasts for this experiment. Proteins from en- 
dosperm homogenates and crude amyloplasts were sepa- 
rated by 15% separating gel, transferred to nitrocellulose 
membranes, and probed with polyclonal antibodies raised 
against maize SH2 and BT2 polypeptides. The same-size 
polypeptides were detected in both the whole homogenate 
and the crude amyloplast preparations (Fig. 4). 

The most significant result was that when equal quanti- 
ties of protein were loaded, the intensities of SH2 or BT2 
antibody-reacting polypeprides(s) were not enriched in the 
proteins from the crude amyloplast fraction relative to 
those in the homogenate (Fig. 4). Rather, based on densi- 
tometer analyses we estimated that the levels of SH2 and 



BT2 polypeptides in the crude amyloplasts were about 
one- third and one-fifth of those in the homogenate, respec- 
tively (Fig. 4). This lack of enrichment in the crude amylo- 
plasts of the AGPase polypeptides was in sharp contrast to 
the approximately 4-fold enrichment of extractable SBE, 
the amyloplast stroma marker enzyme (Table III), and the 
10-fold enrichment of the amyloplast membrane marker 
BT1 in amyloplast membranes recovered from the Percoll- 
purified amyloplasts (Fig. 3). This lack of BT2 and SH2 
enrichment clearly indicates that the majority of the BT2- 
and SH2-antibody-reacting AGPase was localized outside 
of the amyloplasts. 

Because the yield of amyloplasts in the crude amyloplast 
preparation was 31%, based on SBE activity, and cytosol 
contamination was 0.7%, based on ADH activity, we esti- 
mated that 90% and 95.8% of the total SH2 and BT2 pro- 
teins, respectively, or an average of 93% of the cellular BT2- 
and SH2-antibody-reacting polypeptides of AGPase, were 
located in the cytosol (see "Materials and Methods" for 
calculations). These values are very close to the estimates of 
AGPase compartmentation determined by the TCE/hep- 
tane and glycerol-isolation procedures reported above (Ta- 
bles I and II). 

Metabolite Uptake and Incorporation into Starch by 
Isolated Amyloplasts 

Hexose-P Uptake and Incorporation 

Intact amyloplasts were aqueously isolated and purified 
from normal and mutant endosperms and their capacities 



Anti-BT2 
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100% 18% 
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Figure 4. Immunolocalization of AGPase in the homogenate and in 
the crude amyloplasts (100g pellet) from maize endosperm cells. 
Polypeptides were separated by SDS-PAGE (15% separating gel), 
transferred onto nitrocellulose membranes, and probed with poly- 
clonal antibodies raised against maize SH2 polypeptide (left) and 
maize BT2 polypeptide (right) (Giroux and Hannah, 1994). All lanes 
contained 12 ji,g of protein. Lanes 1 and 2 contained peptides from 
the homogenate and the crude amyloplasts, respectively. The relative 
quantity of SH2 and BT2 protein shown on the figure was determined 
by densitometry. 
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for the uptake and use of hexose-Ps and ADP-Glc for starch 
synthesis were determined. Sixty percent or more of the 
amyloplasts used for these studies were judged to be intact 
based on latency analysis (data not shown). Table IV sum- 
marizes the incorporation of [ 14 C]Glc into a methanol- 
insoluble product after incubation of intact or lysed amy- 
loplasts in uptake medium containing [ 14 C]Glc-l-P or 
[ 14 C]Glc-6-P either with or without added ATP. In the 
absence of added glycogen, very little [ 14 C]Glc from either 
hexose-P was incorporated into the methanol-insoluble 
product. Generally, Glc transfer from Glc-l-P was some- 
what higher than that from Glc-6-P, but there was little if 
any effect of added ATP, a substrate for AGPase. In addi- 
tion, incorporation by amyloplast from the AGPase- 
deficient mutant sh2 was equal to that by amyloplasts from 
normal, wx and btl endosperms. 

The similarity in incorporation of [ 14 C]Glc-l-P by all 
genotypes tested and the small effect of added ATP indi- 
cates that polymerization was most likely caused by the 
activity of starch phosphorylase, with little contribution by 
plasnd -localized AGPase. The standard uptake solution 
used in these studies contained relatively high levels of 
3-PGA (12.4 mM), the allosteric activator of AGPase. It is 
possible that the low incorporation of hexose-Ps was the 
result of 3-PGA inhibition of hexose-P uptake, but this was 
ruled out by a later study in which we showed that uptake 
and incorporation of hexose-Ps in the presence of 0.5 and 
12.4 mM 3-PGA were similar (Table IV). 

During incubation some of the amyloplasts are invari- 
ably ruptured, releasing plastid enzymes. The conse- 
quences of this were seen when glycogen, an alternative 
glucan acceptor, was added to the "intact" and lysed amy- 
loplasts incubated with Glc-l-P. Over 10 times more 
[ 14 C]Glc was incorporated into the methanol-insoluble 



polymer but, again, there was little if any effect of added 
ATP (Table IV). This supports the conclusion that the iso- 
lated amyloplasts contain an active starch phosphorylase 
that effectively transfers Glc from Glc-l-P to glycogen, but 
was much less effective in Glc transfer to the native glucan 
acceptors (starch granules) of the amyloplasts. Apparently, 
amyloplasts isolated from sh2 (the only genotype tested) 
contain very little phosphoglucomutase, or that which is 
present is essentially inactive in the uptake conditions 
used, because even in the presence of added glycogen there 
was very little transfer of [ 14 C]Glc from Glc-6-P to the 
methanol-insoluble product (Table IV). 

ADP-Glc Uptake and Incorporation 

Intact maize amyloplasts isolated from 10- to 16-DPP 
normal, wx, and sh2 endosperms incorporated more than 
10 times as much Glc from ADP-Glc into a methanol- and 
water-insoluble product (Table V) as from Glc-l-P (Table 
IV). This difference in incorporation was not caused by a 
difference in the buffer salt or pH of the uptake solutions 
used in the standard conditions, because in a later study we 
determined that there was no difference in Glc incorpora- 
tion from ADP-Glc when the uptake solution was buffered 
with Hepes at pH 7.5 rather than at pH 8.5 (Table V). 
Hydrolysis of the water-insoluble product with ^-amylase 
yielded maltose and the product was judged to be starch 
(data not shown). Lysis of the amyloplasts before incuba- 
tion reduced incorporation 70% to 90%. Among the three 
genotypes, intact amyloplasts from sh2 most effectively 
converted ADP-Glc to starch. In contrast, intact amylo- 
plasts from the other starch-deficient/high-sugar geno- 
type, btl, was only 26% as effective in the uptake and 
conversion of ADP-Glc to starch as amyloplasts from wx, 



Table IV. Hexose-P uptake and incorporation into methanol-insoluble product 

Amyloplasts were from the 20% Percoll fraction. The reaction mixture (200 /il final volume) contained 15 mM Hepes, pH 7.5, 0.08% BSA, 
10 mM MgCI 2 , 12.4 mM 3-PGA, 0.5 M sorbitol, 0.1 unit of inorganic pyrophosphatase, 2 mM Glc-1-P or Glc-6-P, 2 mM ATP, and .1 mg of 
rabbit-liver glycogen (RLG) as indicated plus 60 or 80 fxl of amyloplast fraction. For the uptake studies amyloplasts were isolated from the 
endosperms of the following age kernels: N, 12 DPP; wx, two 12 DPP, one 13 DPP, and one 1 6 DPP for the minus-RLG study, and one 13 DPP 

and one 16 DPP for the plus-RLG study; sh2, one each of 13, 15, and 16 DPP; and btl, one 11 DPP. 

Total Incorporation 

Glc-l-P Glc-6-P 

Genotype (n) 

Intact Lysed Intact Lysed 



•f ATP -ATP +ATP -ATP +ATP -ATP +ATP -ATP 

pmot min~ 1 10~ 6 amyloplasts 



No RLG 


























N (1) 


4.17 


1.83 


2.08 




















wx (4) 


4.54 ± 0.50 


3.90 ± 1.42 


4.12 ± 1.22 








1.96 ± 1.37 


1.25 ± 0.42 


1.08 








wx(1) a 


6.50 




1.36 








7.11 




1.12 








sh2 (3) 


5.11 ± 0.58 


3.87 ± 1.67 


2.81 ± 0.75 


1.52 


± 


1.00 


0.83 ± 0.14 


0.60 ± 0.21 


0.44 ±0.12 


0.72 


± 


0.68 


btl (1) 


5.67 


4.83 


4.50 








6.08 


1.92 


1.58 








Plus RLG 


























wx(2) 


43.75 ± 6.1 


37.78 ± 7.3 


63.94 ±0.7 


58.49 




1.1 














sh2 (2) 






47.56 ± 9.1 


42.93 




9.1 






2.49 ± 0.7 


2.02 




0.6 



a For this experiment the uptake solution contained 0.5 mM 3-PGA rather than the 12.4 mM used for all of the others. 
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Table V. ADP-Glc uptake and incorporation into methanol- and water-insoluble products 

Amyloplasts were from the 20% Percoll fraction. The reaction mixture (200 pi final volume) contained 100 mM Bicine, pH 8.5, 0.5 m sorbitol, 
12.5 mM EDTA, 10 mM GSH, 50 mM potassium acetate, and 4 mM ( 14 CjADP-Glc plus 60 or 80 pi of amyloplast fraction. For the uptake studies 
amyloplasts were isolated from the endosperms of the following age kernels: N, two 10 DPP, one 11 DPP, and two 12 DPP; wx, one 10 DPP, 
three 12 DPP, five 13 DPP, and one 14 DPP; $h2, one each of 15 and 16 DPP; and bt1, one each of 1 1, 12, and 14 DPP. Data are averages ± SE. 



Genotype (n) 


Total Incorporation 


Percentage Water Insoluble 


intact 


Lysed 


Intact 


Lysed 




pmol min~ 1 


10~ 6 amyloplasts 


% of total methanol insoluble 


N(5) 


58.58 ± 24.5 


12.33 ± 7.75 


90 ± 6 


87 ±8 


wx(10) 


67.25 ± 16.83 


6.17 ± 4.00 


79 ± 6 


80 ± 11 


wx(1) a 


49.37 


6.29 


94 


95 


sh2 (2) 


133.75 ± 8.42 


42.92 ± 3.67 


86 ± 10 


92 ± 4 


bt1 (3) 


18.00 ± 6.17 


12.33 ± 4.17 


67 ±4 


87 ± 1 



a For this experiment the uptake solution contained 1 5 mM Hepes, pH 7.5, 0.08% BSA, 10 mM MgCI 2 , 0.5 m sorbitol, and 4 mM ADP-Glc plus 
80 pi of amyloplast suspension. 



but incorporation by lysed btl amyloplasts was similar to 
that by lysed normal and wx amyloplasts. 

The addition of glycogen, an alternative glucosyl accep- 
tor, to the reactions containing the lysed amyloplasts from 
wx and btl genotypes restored incorporation equal to or 
greater than that in reactions containing intact wx amylo- 
plasts (Table VI). Thus, the reduced [ 14 C]Glc incorporation 
from ADP-Glc into starch by btl amyloplasts was caused 
by the reduced transfer of ADP-Glc into the amyloplasts. In 
addition, we can conclude from these results that the re- 
duced incorporation from ADP-Glc by lysed amyloplasts 
in the absence of added glycogen acceptor was apparently 
the result of dilution of the SS relative to the nonreducing 
ends of the native maltooligosaccharide or starch-granule 
acceptors. Enhanced incorporation of [ 14 C]Glc from ADP- 
Glc in the presence of rabbit-liver glycogen (Table VI) 
indicates that glycogen may be a better substrate for SS 
than the native acceptors. 

If an adenylate translocator in the amyloplast membrane 
of the isolated intact amyloplasts is functioning in the 
uptake of ADP-Glc in exchange for ADP or AMP, then 
these ADP or AMP nucleotides in the uptake medium 
might compete with ADP-Glc for uptake. In fact, preincu- 
bation of the amyloplasts in the cold for 30 min in the 
presence of 7 mM ADP or AMP, followed by [ 14 C]ADP-Glc 
uptake and incorporation in the presence of 2.2 mM ADP 
or AMP, reduced 14 C incorporation into starch by 75% and 
82%, respectively (data not shown). SS synthesis activities, 
as measured in reactions containing lysed amyloplasts 
plus glycogen, were inhibited 60% and 63%, respectively, 
by these ADP and AMP treatments. Therefore, in this study 
we were unable to distinguish between the effects of the 
nucleotides on ADP-Glc uptake and their effects on SS 
activity. 

The adenosine analog FSBA is well known to react with 
adenosine nucleotide-binding sites of enzymes and pro- 
teins (Colman, 1983), including mitochondrial F 1 -ATPase 
(Esch and Allison, 1978), chloroplast ATPase (DeBenedetti 
and Jagendorf, 1979), and an ADP-binding protein on the 
exterior surface of human platelets (for review, see Col- 
man, 1983). If ADP-Glc is transported into amyloplasts via 
an adenylate translocator, then we predicted that FSBA 



would inhibit ADP-Glc uptake into intact amyloplasts, re- 
sulting in reduced incorporation of Glc into starch. To test 
this we pretreated intact amyloplasts from wx in the stan- 
dard uptake medium containing 0 to 4 mM FSBA dissolved 
in DMSO. All pretreatment and uptake solutions contained 
2% DMSO (the FSBA solvent), which had no negative effect 
on uptake of ADP-Glc and incorporation of [ 14 C]Glc into 
the methanol-insoluble product in either the absence or 
presence of rabbit-liver glycogen (incorporation in the 
presence of glycogen is a measure of SS activity) (Table 

vn). 

Inhibition of uptake and incorporation by intact amylo- 
plasts increased with increasing concentrations of FSBA. 
FSBA would also be expected to interact with the ADP-Glc- 
binding site of SS, but at 2 and 4 mM FSBA, reduction in 
uptake and incorporation into starch by intact amyloplasts 
was greater than the inhibition of SS as measured in lysed 
Table VII amyloplasts in the presence of added glycogen 
(Table VII). For intact amyloplasts incubated in the absence 
of glycogen (except at the highest FSBA treatment), almost 
three-fourths of the methanol-insoluble radioactivity was 
incorporated into the water-insoluble starch granules. The 
amyloplast-uptake studies provide evidence that cytosolic 
synthesized ADP-Glc can be transferred across amyloplast 
membranes via an adenylate translocator. We have pro- 
posed that BT1 is that adenylate translocator in maize 
endosperms (Shannon et al., 1996). 



Table VI. Effect of glycogen on ADP-Glc incorporation into meth- 
anol-insoluble products 

Amyloplast preparation and uptake conditions were the same as in 
Table V, with the addition of 1 mg of rabbit-liver glycogen (RLG) 
where noted. For each genotype, data are the means of duplicate 
incubations of a single preparation of amyloplasts from 13 DPP wx 
endosperm and from 14 DPP btl endosperm. 



Genotype 




Total Incorporation 


Intact 


Lysed Lysed + RLG 






pmol min~ 1 10~ 6 amyloplasts 


wx 


78.2 


13.6 110.3 


btl 


26.5 


18.1 130.9 
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Table VII. FSBA inhibition of ADP-GIc uptake and incorporation into methanol- and water- insoluble products and inhibition of SS 

Amyloplasts in the 20% Percol! fraction isolated from 13 DPP \vx endosperms were used. The amyloplasts were preincubated for 30 min at 
30°C in their respective reaction mixtures before the addition of I 1 4 C) ADP-GIc. The reaction mixtures were as described in Table VI, with the 
addition of 2% DMSO and FSB A as noted. Reaction mixtures containing 1 mg of rabbiMiver glycogen (RLG) provide a measure of SS activity. 
Control intact and lysed amyloplasts, without DMSO, incorporated 34.62 and 4.63 pmol min' 1 10~ 6 amyloplasts, respectively. 



FSBA 


RLG 


Total Incorporation of 
[ 14 C)ADP-Glc 


Inhibition of incorporation 
by FSBA 


Incorporated into Starch 
Granules 


Intact Lysed 


Intact 


Lysed 


Intact 


Lysed 


/TIM 




pmol min~ ' W 6 amyloplasts 






% 




0 




48.25 5.67 


0 


0 


76 


92 


1 






6 


0 


68 


97 


2 






43 


33 


69 


67 


4 






62 


30 


53 


54 


0 


+ 


57.92 59.08 


0 


0 


27 


8 


1 


+ 




0 


9 


37 


10 


2 


+ 




3 


16 


30 


8 


4 


+ 




20 


41 


25 


10 



Identification of a Putative ADP-Glc-Binding Motif in BT1 

If BT1 is the adenylate translocator functioning in the 
transfer of ADP-GIc into amyloplasts, then BT1 must con- 
tain an ADP-Glc-binding motif. Analysis of the full-length 
BT1 sequence showed the presence of a KTGGL motif. This 
motif was identified as the ADP-Glc-binding site of Esche- 
richia coli glycogen synthase (Furukawa et al., 1993) and is 
conserved in all known enzymes that use ADP-GIc as a 
substrate, including plant SS and bacterial glycogen syn- 
thases (Table VIII). The KTGGL motif in BT1 is 40 amino 



acid residues upstream of the transit-peptide cleavage site 
proposed by Sullivan et al. (1991). Thus, if this proposed 
ADP-Glc-binding motif is present in the mature BT1 pro- 
tein, then an alternative transit-peptide cleavage site is 
required. Comparison of several known N-terminal se- 
quences of SS revealed a consensus cleavage site of V(I)X/ 
A(G,S), and in BT1 an alternative cleavage site, VP /A, is 
present 13 amino acid residues upstream of the KTGGL 
motif, the proposed ADP-Glc-binding site (Table IX). 
Cleavage at this site would yield a mature BT1 protein of 44 



Table VIII. Positions of ADP-Glc-binding motifs in the full-length BT1 protein and in various starch (SS) and glycogen synthases (GS) 



Protein Accession No. Motif Position Reference 



Maize BT1 


M79333 


KTGGL 
V 


35-39 


Sullivan et al. (1991) 


ADP-Glc-binding 




A 

S 






consensus sequence 




KTGGL 






Barley WX 


X07932 


KTGGL 


90-94 


Rohde et al. (1988) 


Cassava WX 


X74160 


KTGGL 


96-100 


Salehuzzaman et al. (1993) 


Maize WX 


M24258 


KTGGL 


90-94 


Kioesgen et al. (1986) 


Pea GBSSI 


X88789 


KTGGL 


91-95 


Dry et al. (1992) 


Pea GBSSII 


X88790 


KTGGL 


255-259 


Dry et al. (1992) 


Potato SS 


X87988 


KTGGL 


360-364 


Edwards et al. (1995) 


Potato SSIII 


X94440 


KVGGL 


794-798 


Abel et al. (1996) 


Potato SSSI 


Y10416 


KTGGL 


145-149 


(G.I.W. Abel, J. Kossman, and L. 










Willmitzer, unpublished data) 


Potato SSSI II 


X95759 


KVGGL 


794-798 


Marshall et al. (1996) 


Potato WX 


X58453 


KTGGL 


95-99 


van der Leij et al. (1991) 


Rice SSS 


D16202 


KSGGL 


97-101 


Baba et al. (1993) 


Rice WX 


X62134 


KTGGL 


97-101 


Okagaki (1992) 


Sorghum WX 


U23945 


KTGGL 


96-100 


Hsieh et al. (1996) 


Sweet potato SS 


U44126 


KTGGL 


98-102 


(S.-J. Wang, K.W. Yeh, and C.-Y. 










Tsai, unpublished data) 


Wheat WX 


X57233 


KTGGL 


91-95 


Clark et al. (1991) 


£ coli CS 


J02616 


KTGGL 


15-19 


Kumar et al. (1986) 


Bacillus stearothermophilus CS 


D87026 


KSGGL 


15-19 


(H. Takata, T. Takata, S. Okada, 










M. Takagi, and T. Imanaka, 










unpublished data) 


Bacillus subtilis GS 


Z25795 


KSGGL 


15-19 


Kiel etal. (1994) 


Agrobacterium tumefaciens GS 


L24117 


KTGGL 


15-19 


Uttaro and Ugalde (1994) 


Synechocystis GS 


D90899 


KAGGL 


15-19 


Kaneko etal. (1996) 
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Table IX. Positions of putative transit peptide cleavage sites in the full-length BT1 protein and in various starch synthases (SS) 



Protein Accession No. Cleavage Site Method Position Ref. 



Maize 8T1 


M79333 


SLQVP/AV 
S 


Sequence alignment 


24/25 


Sullivan et al. (1991) 


Transit peptide cleavage site 




I G 








Consensus sequence 




VX/A 








Barley WX 


X07932 


SWVS/AT 


Sequence alignment 


70/71 


Rohdeetal. (1988) 


Cassava WX 


X74160 


AKHC/GH 


Sequence alignment 


78/79 


Salehuzzaman et al. 












(1993) 


Maize WX 


M24258 


SLWC/AS 


Protein sequencing 


72/73 


Kloesgen et al. (1986) 


Pea CBSSI 


X88789 


GKIVC/GM 


Protein sequencing 


75/76 


Dry etal. (1992) 


Pea GBSSII 


X88790 


KQHVR/AV 


Protein sequencing 


57/58 


Dry et ai. (1992) 


Potato SS 


X87988 


NORVK/AT 


Protein sequencing 


65/66 


Edwards et al. (1995) 


Potato WX 


X58453 


ATIVC/GK 


Protein sequencing 


77/78 


van der Leij et al. (1991) 


Rice SSS 


D16202 


TIFVA/SE 


Protein sequencing 


24/25 


Baba et al. (1993) 


Rice WX 


X62134 


SVWY/AT 


Protein sequencing 


77/78 


Okagaki (1992) 


Sorghum WX 


U23945 


SLWC/AT 


Sequence alignment 


77/78 


Hsiehetal. (1996) 


Wheat WX 


X57233 


SMVVR/AT 


Protein sequencing 


70/71 


Clark et al. (1991) 



kD, which agrees well with the 39 to 44 kD for BT1 reported 
previously (Cao et al., 1995). 

DISCUSSION 

In this paper we summarize the results of several studies 
that strongly support the conclusion that in maize en- 
dosperm most of the cellular AGPase is localized in the 
cytosol, and that the inner amyloplast-membrane-specific 
polypeptide, BT1, is an adenylate translocator that func- 
tions in the transfer of cytosol-synthesized ADP-Glc into 
the amy lop lasts. Denyer et al. (1996) reported that more 
than 95% of AGPase activity in maize endosperm cells is 
extraplastidial. This result was based on aqueous fraction- 
ation of endosperm homogenates from young (11-17 DPP) 
kernels. During fractionation approximately 75% of the 
amyloplast marker enzymes were lost from the amyloplast 
fraction (Denyer et al., 1996), and we know from experience 
that a greater proportion of the more mature amyloplasts 
with the larger starch granules are lysed during isolation. 
Consequently, the final preparation would be enriched 
with amyloplasts containing small starch granules. En- 
zyme compartmentation in such an amyloplast preparation 
may not be representative of compartmentation in the 
more mature amyloplasts from cells in the linear phase of 
starch accumulation. 

This concern is validated by a recent immunolocalization 
study by Brangeon et al. (1997), which clearly shows that 
the peripheral endosperm cells were only lightly immuno- 
labeled by antibodies to BT2 and SH2 and that there was a 
gradient of increasing signal intensity that paralleled the 
increase in starch-granule size. We have developed and 
used nonaqueous TCE /heptane fractionation and non- 
aqueous glycerol-isolation methods to show for the first 
time, to our knowledge, that in the more mature maize 
endosperm cells (20 DPP), 90% or more of the cellular 
AGPase is cytosolic (Tables I and II). These studies were 
based on the observation that during freeze-drying, much 
of the amyloplast stromal content dries onto the surface of 
the starch granule, and the starch granule also shrinks 



away from the cytosol (Liu and Shannon, 1981). Therefore, 
during nonaqueous fractionation (Table I) or isolation (Ta- 
ble II) the stromal enzymes associated with the starch 
granules remain with the granules until they are extracted 
with the aqueous buffer solution. 

With the TCE /heptane protocol, patterned after the pro- 
cedures used by Riens et al. (1991) and MacDougall et al. 
(1995), pulverized endosperm samples were separated into 
several fractions with varying enrichments in amyloplasts 
or cytosol. With this procedure enzyme compartmentation 
in amyloplasts was determined from plots of the activity of 
the enzyme in question per million starch granules against 
the activity of nonplastidial enzymes per million granules. 
This procedure has the advantage that enzyme recovery 
was high after TCE /heptane fractionation. It was clear from 
these data that AGPase closely partitioned with the nonplas- 
tidial marker enzymes (Fig. 1), and 58% to 77% of the plastid 
marker-enzyme activities were retained with the starch 
granules (Table I). 

Although the TCE /heptane procedure could be used to 
estimate enzyme compartmentation in amyloplasts, it was 
not satisfactory for the isolation of a cytosol-free starch- 
granule preparation with associated stromal enzymes 
(amyloplasts). To accomplish this we modified the 
glycerol-based procedure of Liu and Shannon (1981). The 
resulting starch-granule preparations, which were contam- 
inated with about 7% of the cytosol marker enzymes, re- 
tained approximately 14% of the cellular AGPase (7% more 
than cytosolic contamination) and 50% of the amyloplast 
marker-enzyme activities (Table II). It is significant that 
enzyme activities per million starch granules determined 
by both nonaqueous procedures were approximately the 
same (Tables I and II). Results of the nonaqueous studies 
that demonstrated predominant cytosolic localization 
of AGPase were confirmed by immunolocalization of BT2 
and SH2 polypeptides in aqueously isolated amyloplasts 
(Fig. 4). 

Earlier studies of compartmentation of AGPase in maize 
endosperm cells have been controversial. In contrast to the 
extra-amyloplastic localization of AGPase reported by 
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Denyer et aL (1996), results of immunocytolocalization 
studies have been interpreted as showing that most, if not 
all, AGPase is localized in the amyloplasts (Miller and 
Chourey, 1995; Brangeon et al., 1997). Both approaches to 
the study of compartmentation in maize endosperm have 
serious disadvantages. First, as noted above, 75% or more 
of the amyloplasts are ruptured during aqueous isolation, 
resulting in an amyloplast preparation enriched in plastids 
containing the smaller starch granules (Shannon, 1987). 
Second, a serious drawback of immunocytolocalization 
studies of maize endosperm at the electron-microscopic 
level is the difficulty of sufficiently embedding the tissue so 
that the thin slices of starch granules remain in the plastic. 
Consequently, regions of the endosperm consisting of cells 
with small starch granules are more likely to survive prep- 
aration. In maize endosperms such cells occur in very 
young kernels (about 12 DPP) or in the peripheral cells. 

As noted above, enzyme compartmentation in these 
physiologically less mature cells may not be representative 
of compartmentation in cells more actively engaged in 
starch synthesis (Tsai et al., 1970; Brangeon et al., 1997). In 
addition, it is possible that cytosolic enzymes may be lost 
from the tissue piece during preparation of the samples for 
electron-microscopic examination; in fact, Miller and 
Chourey (1995) pointed out that they were unable to im- 
munolocalize the cytosol-specific enzyme SUS. Tissue 
preparation and cutting of the thicker sections suitable for 
immunocytolocalization at the light-microscopic level are 
less problematic than preparation for electron-microscopic 
studies. In a light-microscopic immunolocalization study 
of AGPase compartmentation in maize kernels, Brangeon 
et al. (1997) clearly showed that in pericarp cells of kernels 
8 DPP, polypeptides recognized by antibodies to the 
AGPase subunits BT2 and SH2 were cytosolic, but in 
endosperm cells actively engaged in starch synthesis 
(15 DPP), the antibodies immunolabeled only polypeptides 
that were closely associated with amyloplasts. These 
authors suggested an intraplastidial localization for the 
AGPase polypeptides encoded by Bt2 and Sh2 in these 
maize endosperm cells. However, they suggest that at this 
level of resolution, it is not possible to distinguish between 
proteins in the amyloplast stroma and proteins located 
either between the inner and outer membranes of the amy- 
loplast envelope or closely associated (loosely bound) with 
the outer membrane (Brangeon et aL, 1997). 

The compartmentation results obtained in the present 
study using nonaqueous procedures do not support 
the suggestion of Brangeon et al. (1997) that AGPase is 
localized in the amyloplast stroma. Rather, we show that 
AGPase resides in a compartment that partitions with the 
cytosol during nonaqueous fractionation. However, be- 
cause the soluble enzymes located within the inner- 
membrane space of the amyloplast envelopes and those in 
close association with the amyloplasts in situ would be 
expected to partition with the cytosol during aqueous and 
nonaqueous fractionation or isolation, we were unable to 
rule out the possibility that AGPase resides within the 
inner-membrane space of amyloplasts. 



BTl Is an Adenylate Translocator 

Two phosphate translocators (Fischer et al., 1997; Kam- 
merer et al., 1998) and two adenylate translocators (Mohl- 
mann et al., 1997) have been reported to be present in 
maize endosperm amyloplast membranes. Fischer et al. 
(1997) isolated and characterized a PEP/Pi antiporter that 
is present in plastid membranes from both photosynthetic 
and nonphotosynthetic tissues. The Glc-6-P/Pi antiporter 
was shown to be preferentially expressed in nonphotosyn- 
thetic tissues (Kammerer et al., 1998) and to mediate the 1:1 
exchange of Glc-6-P with Pi and triose phosphate, and is 
assumed to function in vivo in the import of Glc-6-P into 
amyloplasts. Kammerer et al. (1998) suggest that Glc-6-P 
may be used either in the starch biosynthetic pathway or as 
a substrate for the oxidative pentose-phosphate pathway. 
Neuhaus et al. (1993) reported the isolation of amyloplasts 
from maize endosperm that were capable of uptake and 
incorporation Glc-6-P into starch. Mohlmann et al. (1997) 
used a similar amyloplast-isolation procedure and deter- 
mined that Glc from ADP-Glc was incorporated into starch 
at a rate 6 times higher than that from Glc-6-P. 

The amyloplast-isolation procedure used for these stud- 
ies, which included multiple high-speed centrifugations 
through density gradients, yielded a preparation of amy- 
loplasts with very small starch granules (Neuhaus et al., 
1993) and most likely also contained amyloplast membrane 
vesicles without starch granules. We have used a much 
more gentle amyloplast-isolation procedure, and the re- 
sults of the hexose-P uptake and incorporation studies 
presented in this paper do not support the use of Glc-6-P in 
the starch biosynthetic pathway (Table IV). Intact amylo- 
plasts from maize endosperm were relatively inefficient in 
the uptake and conversion of Glc-l-P and Glc-6-P into 
starch regardless of whether ATP was included in the 
uptake medium. In this study we measured incorporation 
of radioactive hexoses into starch and did not attempt to 
determine hexose-P uptake independent of its utilization in 
starch synthesis. If we assume that the amyloplast mem- 
branes contain a functional Glc-6-P/Pi antiporter, then the 
imported Glc-6-P is a poor substrate for starch synthesis. 
The amyloplasts used for these uptake and incorporation 
studies were isolated from young kernels that may not 
have developed their full complement of AGPase activity 
(Tsai et al., 1970; Brangeon et al., 1997), and this may have 
contributed to the poor utilization of hexose-Ps. However, 
when considering the predominant cytosolic localization of 
AGPase (Table I and II), the poor hexose-P utilization may 
simply reflect the minor role of amyloplastic AGPase in 
starch synthesis. 

Amyloplasts isolated from wx and sh2 endosperms ap- 
parently do contain active starch phosphorylase, because 
when glycogen, an alternative glucan acceptor, was in- 
cluded in the uptake mixture Glc was effectively trans- 
ferred from Glc-l-P to the glycogen acceptor (Table IV). In 
contrast, Glc-6-P was a poor substrate for Glc addition to 
glycogen, indicating either that the amyloplasts from sh2 
contain very little phosphoglucomutase or that it is inactive 
in the incubation conditions used. 
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Results of in vivo studies of the starch-deficient maize 
endosperm mutant btl support the conclusion that BT1, an 
amyloplast-membrane-specific polypeptide (Cao et al., 
1995; Sullivan and Kaneko, 1995), is an adenylate translo- 
cator that functions in ADP-Glc transfer into amyloplasts. 
For example, ADP-Glc, which is synthesized by AGPase, 
accumulates in the endosperm of btl mutant kernels 
(Shannon et al, 1996). Activities of AGPase, UGPase, SS, 
extractable SBE, and SUS in extracts from btl mutant en- 
dosperms were equal to or greater than activities in en- 
dosperm extracts from normal kernels (Shannon et al., 
1996). The genetic lesion in btl kernels was found to be an 
amyloplast-membrane-specific, 39- to 44-kD polypeptide, 
BT1 (Cao et al., 1995; Sullivan and Kaneko, 1995). 

Based on these results, we suggest that BT1 is an ade- 
nylate translocator that functions in the transfer of ADP- 
Glc from the cytosol into the amyloplast, and in its absence 
(i.e. in btl mutant kernels) ADP-Glc accumulates (Shannon 
et al., 1996). The most direct support for this suggestion is 
provided by the marked difference in the uptake of ADP- 
Glc and its use for starch synthesis by amyloplasts isolated 
from btl endosperms and amyloplasts isolated from nor- 
mal, wx, and $h2 endosperms (Table V). Intact amyloplasts 
from btl endosperms, which are missing the BT1 polypep- 
tides, were only 26% as effective in taking up and convert- 
ing ADP-Glc to starch as those from the other genotypes 
(Table V). 

Several lines of evidence support the conclusion that we 
were measuring ADP-Glc uptake and utilization by intact 
amyloplasts and not simply synthesis by SS associated with 
granules released from lysed amyloplasts: (a) ADP-Glc 
incorporation by lysed amyloplasts was only about 10% of 
that by intact amyloplasts; (b) for many of the uptake and 
incorporation studies we used amyloplasts from the wx 
endosperm mutant, which is deficient in the starch- 
granule-bound starch synthase (Shannon and Garwood, 
1984); and (c) the adenosine analog FSB A, which is known 
to react with adenosine-binding sites (Colman, 1983), more 
effectively inhibited uptake and incorporation of ADP-Glc 
than starch synthase (Table VIII). 

Comparison of the translated full-length sequence of BT1 
with protein sequences of 45 adenylate translocators from 
20 species revealed about 30% identity and 81% similarity 
within the highly conserved regions of the mitochondrial 
adenylate translocators (Cao and Cao, 1997). If BT1 is an 
adenylate translocator, as was suggested, then the mature 
protein should contain an ADP-Glc-binding motif. How- 
ever, no ADP-Glc-binding motif was present in the mature 
BT1 protein, assuming that the transit-pep tide cleavage site 
VRA/A that was proposed by Sullivan et al. (1991) is 
correct. However, analysis of the full-length BT1 amino 
acid sequence showed the presence of the putative ADP- 
Glc-binding motif, KTGGL, 40 amino acid residues up- 
stream of the cleavage site proposed by Sullivan et al. 
(1991) (Table VHT). For BT1 we propose an alternative 
transit-pep tide cleavage site, VP/ A, 13 amino acids up- 
stream of the putative ADP-Glc-binding motif (Table IX). 
Transit-pep tide cleavage at this site would yield a mature 
BT1 of 44 kD, which agrees well with the size we reported 
for mature BT1 (Cao et al, 1995) but is somewhat larger 



than the 39.5- and 38.5-kD mature BT1 polypeptides re- 
ported by Li et al. (1992). 

In summary, we have provided evidence that most of the 
cellular AGPase in maize kernels in both the linear phase 
and in the early phase of starch accumulation resides in a 
compartment that partitions with cytosolic marker en- 
zymes after nonaqueous and aqueous fractionation. How- 
ever, based on the immunolocalization study of Brangeon 
et al. (1997), we suggest that in situ AGPase is functionally 
compartmented with the amyloplasts and may be loosely 
associated with the outer membrane of the amyloplast 
envelope. ADP-Glc is transported into the amyloplast 
stroma via BT1, which may be the same transporter as the 
ADP-Glc /AMP adenylate translocator described by Mohl- 
mann et al. (1997). The importance of the BT1 translocator 
to starch accumulation in maize endosperms is demon- 
strated by the severely reduced starch content in btl mu- 
tant kernels (Tobias et al., 1992). Assessment of the relative 
importance of the hexose-P/Pi antiporter for starch accu- 
mulation in vivo awaits isolation of a mutant genotype 
defective in the hexose-P/Pi antiporter. 
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Sequential rounds of error-prone PCR to introduce random 
mutations and screening of the resultant mutant libraries 
have been used to enhance the total catalytic activity of 
subtilisin E significantly in a non-natural environment, 
aqueous dimethylformamide (DMF). Seven DNA substitu- 
tions coding for three new amino acid substitutions were 
identified in a mutant isolated after two additional genera- 
tions of directed evolution carried out on 10M subtilisin E, 
previously 'evolved' to increase its specific activity in 
DMF. A Bacillus subtitis-Escherichia coli shuttle vector 
was developed in order to increase the size of the mutant 
library that could be established in B.subtilis and the 
stringency of the screening process was increased to reflect 
total as well as specific activity. This directed evolution 
approach has been extremely effective for improving 
enzyme activity in a non-natural environment: the 
resulting-evolved 13M subtilisin exhibits specific catalytic 
efficiency towards the hydrolysis of a peptide substrate 
succinyl-AIa-Ala-Pro-Phe-/7-nitroanilide in 60% DMF solu- 
tion that is three times that of the parent 10M and 471 
times that of wild type subtilisin E. The total activity of 
the 13M culture supernatant is enhanced 16-fold over that 
of the parent 10M. 

Keywords: B.subtilis/diiectcd evolution/subtilisin E/total activity/ 
random mutagenesis/organic solvents 



Introduction 

Available natural enzyme resources are being tailored to fulfil 
increasing demands for new biocatalysts. A practical strategy 
for altering enzyme properties is to introduce random base 
substitutions into the gene sequence and then select or screen 
for variants that express the desired phenotype(s). Features 
that have been enhanced by random mutagenesis include 
catalytic activity (Graham etai, 1993), activity in organic 
solvents (Chen and Arnold, 1993), thermostability (Bryan 
etai, 1986; Liao etai, 1986; Joyet etal. f 1992), alkaline 
stability (Cunningham and Wells, 1987) and substrate specifi- 
city (Oliphant and Struhl, 1989; Graham et al % 1993). Once the 
genes are sequenced, effective mutations can be accumulated by 
site-directed mutagenesis (Joyet etal. % 1992; Strausberg et a/., 
1995). An attractive alternative to sequencing and site-directed 
mutagenesis is to accumulate beneficial mutations in sequential 
rounds of random mutagenesis (Chen and Arnold, 1993) or 
by a novel recombination approach (Stemmer, 1994), following 
a 'directed evolution* strategy (Arnold, in press). Directed 



evolution is likely to prove useful in enhancing enzyme 
performance in 'non-naturaF environments (Arnold, 1993) as 
well as for obtaining new features never required by nature, 
provided an efficient selection or screening method can be 
found to channel the enzyme's evolution towards the desired 
properties. A significant advantage of this approach over 
'rational' design methods is that neither structural information 
nor a mechanistic road-map are required to guide the directed 
evolution experiment 

In addition to creating new enzymes for applications in 
biotechnology, directed evolution methods can be used to 
explore the limits of protein function. Although naturally 
occurring enzymes have evolved within the context of living 
organisms, there is no reason to believe that they are limited 
to the relatively narrow set of conditions that will support life. 
Water- mi scible organic solvents such as dUmethylfonnamide 
(DMF), for example, are highly toxic to most organisms, even 
at levels of <10%. Although enzymes can often tolerate much 
higher concentrations and still retain their folded structures, 
catalytic activity is often significantly compromised. To probe 
whether enzymes could be 'tuned' to tolerate and even thrive 
in an apparently hostile environment (high concentrations of 
DMF), we applied a directed evolution strategy involving 
sequential generations of polymerase chain reaction (PCR) 
random mutagenesis and screening to the serine protease 
subtilisin E (Chen and Arnold, 1993). Mutations that improved 
the specific activity of subtilisin in aqueous DMF were not 
rare; mutations conferring -2-fold enhancements in specific 
proteolytic activity could be found by screening only a few 
hundred colonies from each generation (Chen and Arnold, 
1993). The accumulation of effective mutations in sequential 
rounds of random mutagenesis led to a significant enhancement 
of the enzyme's activity: the specific activity of the 'evolved* 
enzyme containing 10 amino acid substitutions (10M subtilisin 
E) was enhanced 157-fold in 60% DMF, recovering much of 
the activity lost by addition of the organic solvent. 

Subtilisin is a useful catalyst for organic synthesis, particu- 
larly in the presence of organic solvents. Subtilisin can catalyze 
regioselective (Riva and Klibanov, 1988; Riva etal y 1988) 
and stereoselective (Margolin et al, 1991) acylations in organic 
media. The enzyme also catalyzes peptide synthesis, either by 
direct reversal of the hydrolytic process or by aminolysis of 
N-protected amino acid or peptide esters (Wong and Wang, 
1991). Obtaining high enzyme activity is important for syn- 
thetic applications such as these. The specific activity of 
subtilisin E was greatly enhanced by the directed evolution 
experiment described above. However, the total activity of the 
Bacillus subtilis culture supernatant was reduced to <10% 
that of the starting wild type culture supernatant in the absence 
of DMF (Table I). This drop in total activity reflects a large 
decrease in the expression of the 'evolved' 10M enzyme in 
the B.subtilis host. Such changes in expression levels are not 
unexpected, since DNA base and amino acid substitutions may 
affect the transcription, translation or enzyme export and 
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maturation process. This result can be directly attributed to 
the use of a screening method that focused only on improving 
specific activity in aqueous DMF. 

While mutations which slightly improved specific subtilisin 
activity in DMF could be found by screening only a few 
hundred colonies on average, fine-tuning a combination of 
expression levels and specific activity could be expected to 
require a search of larger variant libraries. Small libraries of 
subtilisin variants in B.subtilis can be obtained by direct 
transformation of the ligation products of the PCR random 
mutagenesis (Chen and Arnold, 1993). The low efficiency of 
B.subtilis transformation with ligated recombinant plasmids, 
however, limits the size of the mutant library that can be 
established in this host Here we report further directed 
evolution of subtilisin, using a B.subtilis-Escherichia coli 
shuttle vector to facilitate establishment of the mutant library 
in B.subtilis. By slightly altering the screening method to 
reflect the total subtilisin activity as well as activity in 
aqueous DMF, significant improvements in both measures 
were obtained. 



Materials and methods 

Restriction enzymes were purchased from Boehringer 
Mannheim (Indianapolis, IN). Reagents for oligonucleotide 
synthesis were purchased from Pharmacia (Piscataway, NJ). 
Succmyl-Ala-AJa-Pro-Phe-p-nitroaniUde (s-AAPF-pNa) was 
obtained from Sigma (St Louis, MO). Nitrocellulose mem- 
branes were purchased from Schleicher & Schuell (Keene, 
NH). Biodyne membranes were obtained from Pall (Glen 
Cove, NY). 

Bacterial strains and plasmids 

Bacillus subtilis strain DB428 and Bacillus cloning vector 
pKWZ containing the subtilisin E gene (Park etal., 1989) 
were kindly provided by R.Doi, University of California, 
Davis. Plasmid pUC19 and E.coli strain ER 1648 were from 
New England Biolabs (Beverly, MA). Plasmid pUBHO was 
from Bacillus Genetics Stock Center, Department of Bio- 
chemistry, The Ohio State University. Phagemids SKQ+ 
and SKII- were purchased from Stratagene (La Jolla, CA). 
Competent E.coli ER1648 was prepared according to the 
standard Ca 2+ protocol (Sambrook etal. t 1989). Bacillus 
subtilis DB428 was made competent for transformation as 
described previously (Chen and Arnold, 1993). Escherichia 
coli ER1648 cells containing plasmid pBE2 were grown on 
LB agar plates or LB broth supplemented with 20 u.g/ml 
kanamycin. Bacillus subtilis DB428 cells harboring plasmids 
pKWZ or pBE2 were grown on LB agar plates or LB broth 
supplemented with 50 u,g/ml kanamycin. 



Table I. TotaJ activities of supcrnatants of B.subiilis expressing wild type 


and variant subtilisins E 


Subtilisin £ 


0% DMF (units/ml) 20% DMF (units/ml) 


WT 


4.5 0.2 


I0M 


0.27 0.2! 


I2M 


1.4 \SS 


13M 


2.7 3.4 



Assay is for hydrolysis of s-AAPF-pNA in 0.1 mM Tris-HCI, 10 mM 
CaCI 2 . pH 8.0 with and without 20% DMF (v/v). 



Construction of B.subtilis-E.coli shuttle expression vector 
pBEl 

Shuttle vector pBEl was constructed by ligation of three 
DNA fragments. The first was the SnaBl-BamHl fragment 
(nucleotides 2164-793) of plasmid pUBllO containing the 
Bacillus replication origin region and the kanamycin resistance 
gene (McKenzie etal, 1986, 1987). The second and third 
fragments were generated from pUB 1 1 0 and pUC 1 9 by the PCR 
(Saiki etal, 1988) using synthetic oligonucleotide primers. 
To obtain the second fragment, oligonucleotides M05B (5' 
CTCTTCCITC QGAECCTATG TAAATCGCTC C TT 3') a nd 
M03P (5' CCTCCTTTCC CAQCDjGAAC GAGACTTTGC 
AGT 3') were used as the 5' and 3' primers, respectively, to 
amplify the region from nucleotides 1075 to 1651 of pUBllO 
containing the minus-strand replication origin. The 3' primer 
was designed to eliminate the unique Ndel site of pUBllO to 
facilitate the subsequent cloning of the subtilisin E gene for 
random mutagenesis. The first 10 nucleotides at the 5' ends 
of both primers were random sequences to facilitate restriction 
enzyme digestion. The underlined sequences were the restric- 
tion sites for BamHl and PvuU respectively. The third fragment 
containing the replication origin of E.coli plasmid pUC19 
(nucleotides 630-1559) was generated using the 5' and 3' 
primers C0E5H (5' TCTTCAGCCTGHGACCTGC ATTAA- 
TGAATCGG 3') and COE3H (5' TTCTCITCCTGIIGACT- 
AAAAAGAAGCAGGTTTTCTrATACCTGCTTCTTTT. 
T AC AGCTG A TCTAGGTG AAG ATCC 3'). respectively. The 
first 10 nucleotides of COE5H and COE3H are random 
sequences. The underlined sequences are restriction sites for 
HincU and PvuU respectively. Primer COE3H contains the 
transcriptional terminator sequence of the subtilisin E gene 
(shown in bold face), which was designed to be positioned 
immediately downstream from the kanamycin gene in pBEl. 
The PvuU site in primer COE3H was used to deterrnine the 
orientation of the third fragment in pBEl and to facilitate future 
plasmid manipulation. The subtilisin E gene was removed 
from plasmid pKWZ by cutting with EcoRl and BamHl and 
subcloned into EcoRI-£fcimHI-digested pBEl to form pBE2 
(Figure 1). 

PCR-based random mutagenesis 

Random mutagenesis was carried out using error-prone PCR 
(Leung etal y 1989). Two synthetic oligonucleotides SUB5N 
(5' GATCCGAGCGTTGi^IAEQTGGAAGAAGATCAT 3') 
and SUB3B (5' GGTTCI TGGATCC GATT-CAA CATGCGG- 
AG 3') permitted amplification of the full-length subtilisin E 
sequence. The underlined sequences are Ndel and BamHl 
restriction sites for SUB5N and SUB3B respectively, which 
allowed the PCR products to be ligated with vector pBE2 
digested with the same enzymes. The ligation mixture was then 
transformed into E.coli ER1648 competent cells to generate a 
library of subtilisin E mutant genes. The mutant library was 
isolated from E.coli ER1648 host cells and transferred into 
B.subtilis DB428 competent cells for expression and screening. 
Screening for enhanced subtilisin activity in aqueous DMF 
Modified Schaefer's agar plates (Leighton and Doi, 1971) 
containing 1% casein were used for B.subtilis growth and 
subtilisin expression. Agar plates were first covered with 
two membranes, one organic solvent-resistant Biodyne nylon 
membrane and one nitrocellulose membrane, onto which 
B.subtilis trans form ants were plated. Bacillus subtilis cells 
containing an unmutated subtilisin E gene transformed with 
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pBE2 were used as controls. After overnight incubation at 
37°C, observable halos indicated that colonies were secreting 
active subtilisin. The top membrane carrying the B.subtilis 
colonies was removed and kept at 4°C. The bottom membrane 
(which contains enzyme at each spot corresponding to indi- 
vidual halos) was then transferred onto a new set of agar 
plates containing 1% casein and 35-45% DMF. These DMF 
plates were incubated at 37°C overnight. The halo sizes on 
both the aqueous and DMF plates were then evaluated and 
compared with those of unmutated control colonies. Colonies 
which outperformed the controls were selected and grown in 
2 ml modified Schaeffer's medium at 37°C for 48 h in a rotary 
shaker. The supernatants of the liquid cultures were used for 
assays of total hydrolytic activity towards the peptide substrate 
s-AAPF-pNA in 0.1 mM Tris-HCl, pH 8.0 and 10 mM CaCl 2 . 
DMF was added as indicated (volume %). 

Sequencing of subtilisin E variant genes 
Selected plasmids were digested with EcoRl and BamHl and 
the subtilisin E gene subcloned into Bluescript SKH+ and 
SKU- phagemids with EcoKL and BamHl. Single-stranded 
phagemids containing both orientations of a variant subtilisin 




Fig. 1. The B.subtilis-E.coli shuttle vector pBE2 carrying a subtilisin E 
gene, constructed from pUBHO and pUCI9 (see Male rials and methods). 
The Nde\ site of pUBI 10 downstream from the minus origin (MO) region 
was eliminated so that the Nde\ site in the subtilisin E gene can be used for 
generating the PC R- based mutant library. 



E gene were prepared according to the supplier's instructions. 
Dideoxynucleotide DNA sequencing was carried out using the 
Sequenase II kit (USB). 
Enzyme purification and kinetics 

Subtilisin E purification and kinetic studies were carried out 
as described previously (Chen and Arnold, 1993). s-AAPF- 
pNa (0.24 mM) was used to assay hydrolytic activity in 0.1 M 
Tris-HCl, pH 8.0, 10 mM CaCl 2 , 37°C, with DMF added as 
indicated. With the exception of the total activity assays, which 
were carried out using culture supernatant, all activity assays 
were carried out with purified enzymes. One unit of subtilisin 
E is the amount of enzyme required to catalyze the hydrolysis 
of 1 nmol substrate to products in 1 min. 

Molecular modeling 

Modeling of subtilisin E and its variants was carried out using 
the InsightH computer program based on the X-ray crystal 
structure of the subtilisin from Bacillus mesentericus (Dauter 
et ai t 1991) (Protein Data Bank reference: 1MEE). Subtilisin 
E from B.subtilis differs from subtilisin from B. mesentericus 
only at three out of 275 amino acid positions. The energy 
minimized s-AAPF-pNA-subtilisin complex is in good agree- 
ment with a similar model prepared for subtilisin BPN' and 
s-AAPF-pNa (Wells and EsteU, 1988). 

Results and discussion 

The B.subtilis-E.coli shuttle expression vector pBE2 for 
directed evolution 

Successful application of directed enzyme evolution requires 
screening large numbers of variants for the desired features). 
Provided the screen is sufficiently sensitive, mutations confer- 
ring small enhancements can be identified and subsequently 
combined to achieve the desired result. The size of the mutant 
library that can be established in E.coli is generally not a 
limiting factor for screening, since even the most rapid screen- 
ing methods can only handle -lO 6 or fewer clones. The direct 
cloning of mutant subtilisin E genes into B.subtilis competent 
cells, however, occurs with low efficiency (a few hundred 
transforrnants per microgram of DNA) relative to E.coli (10 6 - 
10 7 transform an ts/(Jlg DNA). Screening in B.subtilis is therefore 
quickly limited by the size of the mutant library. This is at 
least partially due to the fact that only multimeric linear or 
circular forms of plasmids can efficiently transform B.subtilis 
(Canosi etal, 1978; Mottes etal. t 1979; de Vos etal, 1981). 
The monomelic ligation products from the linearized plasmid 
and target gene cannot transform B.subtilis competent cells 
(de Vos etai f 1981). 

The B.subtilis-E.coli shuttle vector pBEl was designed in 
order to facilitate the establishment of larger mutant libraries 
in B.subtilis. The shuttle vector was constructed by ligating 



Table 11. Kinetic parameters k^, K m and k^K^ for hydrolysis of S-AAPF-pNn 
containing 0, 20 and 60% (v/v) DMF at 37°C 


by subtilisin E variants in 0.1 M Tris-HCl, 10 mM CaCl 2f pH 8.0, 


Variant 


0% DMF 






20% DMF 






60% DMF 


K m (mM) 




(M-'s^XlO- 3 ) 


K m (mM) 


*« (s-») 


k^JK 

(M-'s-'xiO- 3 ) 


(M-^XIO- 3 ) 


WT 


0.56 


21 


38 


17 


12 


1.4 


0.014 (])■ 


I0M 


0.1 


27 


270 


0.7 


73 


99 


2.2 (157) 


13M 


0.067 


39 


582 


0.4 


98 


245 


6.6 (471) 



"Catalytic efficiency relative to wild type subtilisin E. 
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Fig. 2. Comparison of the catalytic efficiencies of wild type and I3M 
subtilisin E towards hydrolysis of s-AAPF-pNa in 0.1 M Tris-HCl, pH 8.0, 
10 mM CaCI 2 with DMF (v/v), at 37X. 

three DNA fragments from plasmids pUBllO and pUC19 
containing the replication origin region and the kanamycin 
resistance gene of pUBllO, the portion of pUBllO covering 
the minus-strand replication origin and the replication origin 
region of pUC19 (see Materials and methods). The subtilisin 
E gene was subcloned from pi as mid pKWZ to form pBE2 
(Figure 1). The stabilities of plasmids pBEl and pBE2 were 
found to be similar to that of pUBllO in B.subtilis (data 
not shown). 

With the aid of shuttle vector pBE2, mutant libraries 
established first in E.coli could be easily amplified, isolated 
by a routine miniprep method and transferred into frozen 
B.subtilis competent cells. A significant fraction of plasmids 
isolated from E.coli ER 1648 is in the form of a supercoiled 
dimer of multimers (data not shown), which transform B.sub- 
tilis competent cells with high efficiency (de Vos et ai, 1981). 

Screening for increased subtilisin activity in aqueous DMF 

Sequential rounds of random mutagenesis and screening to 
identify and accumulate positive mutations resulted in a 
subtilisin E variant (10M) with significantly enhanced specific 
activity towards peptide hydrolysis in aqueous DMF (Chen 
and Arnold, 1993). The k<JK m of the 10M variant in 60% 
DMF, for example, is enhanced 157 -fold relative to the 
wild type enzyme for hydrolysis of s-AAPF-pNa (Table II). 
However, as shown in Table I, the total hydrolytic activity of 
the B.subtilis culture supernatant was reduced to less than 10% 
that of the starting wildtype culture, reflecting a very significant 
drop in the overall expression level. This is consistent with 
our observation that B.subtilis colonies expressing the 10M 
mutant generate much smaller halos on 1% casein plates 
containing no DMF than colonies expressing the wild type 
enzyme. 

Starting with the gene for the 10M subtilisin E, two 
additional generations of PCR-based random mutagenesis and 
screening were conducted, using slightly modified screening 
criteria. After the first round of mutagenesis, one colony with 
a larger halo than the control colony (cells containing the 
parent 10M subtilisin E gene) on both casein plates containing 
DMF and no DMF was selected from several thousand colonies 
screened. That gene was isolated and subjected to a second 
round of random mutagenesis. A single colony producing 
larger halos on both aqueous and DMF plates was similarly 



Table 111. Residual activities (reported as % of starting activity) of wild 
type, 10M and 13M subtilisins E after incubation in 70% DMF at 25°C 



Subtilisin E Incubation time 





60h 


130 h 


460 h 


Wild type 


91 


88 


48 


10M 


88 


85 


42 


13M 


92 


89 


50 



Table IV. DNA and amino acid substitutions in 1 3M subtilisin E, obtained 
by site-directed mutagenesis and random mutagenesis and screening for 
enhanced specific activity in aqueous DMF 



Base 


Base 


Position in 


Amino acid 


Amino acid 




substitution 


codon 




substitution 


595 


A->C 


3 
1 


42 


Silent 


596 


G-»A 




597 


C— Kj 


2 


48 


Ala— > Am 


632" 


G->A 


1 


60 


Asp— *Asn 


643 


T->C 


3 


63 


Silent 


664 


T->G 


3 


70 


Silent 


706 
744" 


T->C 


3 


84 


Silent 


A-K3 


2 


97 


Asp->Gly 


762" 


A->G 


2 


103 


G)n->Arg 


773 


A-»G 


1 


107 


IIe->Val 


838 


A->G 


3 


128 


Silent 


846 


G->A 


2 


131 


G)y-*Asp 


871 


T-+C 


3 


139 


Silent 


921 


A-*G 


2 


156 


Glu->Cly 


946 


A-^G 


3 


164 


Silent 


996 


A->G 


2 


181 


Asn->Ser 


998 . 


A-»G 


1 


182 


Ser-K3ly 


1016 


T-+C 


1 


188 


Ser->Pro 


1033 


T-*C 


3 


193 


Silent 


1071 


A-»T 


2 


206 


Gln-»Leu 


1107' 


A->G 


2 


218 


Asn-»Ser 


1126 


T->C 


3 


224 


Silent 


1141 


A->G 


3 


229 


Silent 


1153 


A-K3 


3 


233 


Silent 


1207 


A-vG 


3 


251 


Silent 


1216 


A->G 


3 


254 


Silent 


1217 


A-^G 


1 


255 


Thr->Ala 



DNA and amino acid substitutions resulting from two generations of 
random mutagenesis and screening of 10M subtil iisin E (Chen and Arnold. 
1993) for enhanced total and specific activity in aqueous DMF are 
underlined. 

"Substitutions incorporated by site-directed mutagenesis into DNA coding 
for 4M subtilisin E used as template for sequential random mutagenesis. 

identified from that B.subtilis library. The enzymes isolated 
from these first- and second-generation colonies are termed 
12M and 13M, respectively. 

Catalytic activities of the 'evolved' enzymes 
The total activities of supernatants of B.subtilis cultures 
expressing wild type, 10M and the further-evolved 12M and 
13M subtilisin E genes towards hydrolysis of s-AAPF-pNa 
are summarized in Table I. The 12M variant from the first 
round of random mutagenesis showed significantly greater 
activity with and without DMF, relative to its parent 10M. 
The total activity is in fact slightly greater in the presence of 
20% DMF than in purely aqueous buffer. The 13M variant 
from the second generation continues this trend: the total 
activity in the absence of DMF is 10- fold greater than the 
starting 10M variant, while activity in 20% DMF is improved 
by a factor of 16. In contrast to wild type subtilisin E, which 
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Fig. 3. Model of subtiiisin E showing the 10 amino acid mutations in I0M (yellow) and the additional three mutations identified in the 13M variant (blue) 
evolved for increased total activity in DMF. Bound Ca 2+ and peptide substrate s-AAPF-pNA are shown in gray. 



retains only 4.4% of its activity in the presence of 20% DMF, 
13M subtiiisin E is actually 25% more active in 20% DMF 
than with no DMF at all. 

The subtiiisin variants were purified and further character- 
ized. The kinetic parameters for the hydrolysis of s-AAPF- 
pNa by wild type subtiiisin £ and the selected variants are 
summarized in Table n. The specific catalytic efficiency 
kcJKm of 13M subtiiisin E is approximately 2.4-fold higher 
than that; of 10M in 20% DMF and 175 times greater than 
that of the wild type enzyme. The increase in catalytic 
efficiency reflects an increase in as well as a decrease in 
the K m (Table II). 

As observed previously for the 10M variant, the enhancement 
in the catalytic efficiency of 13M subtihsin E extends to very 
high concentrations of DMF (figure 2). The catalytic efficiency 
of 13M subtiiisin is less sensitive to DMF than, the wild type 
enzymt over the entire range of DMF concentration. In 60% 
DMF, the k^K m for 13M is 3-fold higher than 10M and 471- 
fold higher than the wild type subtiiisin E. 

The loss of activity that the wild type enzyme experiences 
in DMF has been substantially recovered by sequential genera- 
tions of random mutagenesis and screening, to the point that 
the evolved enzyme is actually better at this particular catalytic 
task in low DMF concentrations (<40%) than the wild type 
enzyme in purely aqueous media. In 20% DMF, 13M subtiiisin 



E is -6.5 times more efficient than the wild type enzyme in 
aqueous solution (Figure 2). In 40% DMF, it is still 1.5 times 
more efficient than is wild type subtiiisin E in the absence of 
DMF. This result is not as surprising as it might seem at first 
glance. It should be noted that in aqueous media the evolved 
enzymes are all 5- to 10-fold more efficient than the wild type 
enzyme. This improvement in the KJK m is largely due to a 
decrease in the K m for the specific substrate s-AAPF-pNa used 
for screening (Chen and Arnold, 1993). Because the enzyme's 
evolution has been directed towards improving activity towards 
this particular substrate as well as catalytic activity in DMF, 
it is not unreasonable that the evolved enzyme is even more 
efficient than wild type. 

Enzyme stability 

The time courses of deactivation of the wild type, 10M and 
13M subtilisins were compared at 25°C in 70% DMF (Table 
HI). Under these conditions the evolved subtiiisin will catalyze 
the polymerization of amino acid esters (Chen and Arnold, 
1993). After 460 h incubation, mutant 13M retained -50% 
activity, essentially the same as the wild type enzyme. As 
observed previously for the directed evolution of the 10M 
parent enzyme (Chen and Arnold, 1993), the mutations that 
enhance catalytic efficiency and total expressed activity have 
no significant net effect on the enzyme's stability, as measured 
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by residual activity after incubation in aqueous DMF. Since 
the enzyme is sufficiently stable to maintain its activity during 
the screening process, in the presence of DMF, obviously no 
benefit is derived from mutations that only enhance stability. 
The fact that mutations enhancing catalytic efficiency occur 
without corresponding changes in stability indicate that activity 
and stability in DMF are not coupled traits. 

DNA and amino acid sequence changes 
Site-directed mutagenesis followed by five generations of 
sequential random mutagenesis and screening resulted in the 
accumulation of a total of 27 DNA base substitutions coding 
for 13 amino acid substitutions in 13M subtilisin E (Table IV). 
Two generations of random mutagenesis and screening resulted 
in seven new base changes (underlined in Table TV) distributed 
across the 800 bp portion of the 10M subtitisin E gene sequence 
subjected to random mutagenesis. The PCR mutagenesis 
method used previously to obtain 10M subtilisin E involved 
introduction of dGTP and dATP in a 10:1 ratio and resulted 
in a preponderance of transitions (A— >G or T-*C). A more 
balanced distribution of transversions (three) and transitions 
(four) was obtained using the method of Leung tt al (1989). 
These seven base changes give rise to three amino acid 
substitutions, two of which (A48R and 1 107V) appeared during 
the first cycle of mutagenesis and screening (12M subtilisin 
E) and a third (Q206L) in the second cycle (1 3M). Substitutions 
1 107V and Q206L are the results of single base substitutions 
in their corresponding codons. Substitution of Ala at position 



48 by Arg, on the other hand, is the result of a double base 
change in the codon for amino acid 48. 

As shown in Figure 3, the 10 amino acid substitutions in 
the parent 10M enzyme (shown in yellow) are clustered on 
the face of enzyme that harbors the active site and substrate 
binding pocket. The three new amino acid substitutions found 
in 13M subtilisin E (shown in blue) also appear more or less 
on the same face. All three new substitutions reside on 
the enzyme surface, as do all the previous 10 amino acid 
substitutions. Figure 3 shows that the 10 amino acid substitu- 
tions in 10M subtilisin E are located in loop structures 
connecting elements of secondary structure. In contrast, the 
three new amino acid substitutions in 13M are located in 
a-helix (1107 V) or P-sheet (A48R and Q206L) secondary 
structures. 1 107V lies in the substrate binding pocket, where 
it interacts directly with the peptide substrate (shown in gray), 
while the remaining two mutations are located well outside 
this area. The substitution of He 107 with Val was found in an 
earlier study to increase the stability slightly of subtilisin from 
Bacillus amyloliquefaciens towards autolysis under alkaline 
conditions (Cunningham and Wells, 1987). 

Comparison of the amino acid sequences of more than 40 
subtilisins from different sources shows a conserved core with 
insertions and deletions that are preferentially confined to 
surface loops (Siezen etal., 1991). As indicated in Figure 4, 
the core a-helix and P-sheet secondary structures are relatively 
conserved, while the sequences of the peptide loops that 
connect these secondary structure elements are variable among 
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the different subtilisins (VRs). Figure 4 clearly shows the 
positions of the 10 amino acid substitutions in 10M subtilisin 
E in these variable loops on the surface of the enzyme 
surrounding the active site. In contrast, the three new amino 
acid substitutions in 13M lie within secondary structures that 
are more conserved throughout the different subtilisins. 

Although these secondary structures are more conserved in 
their sequences, variations do occur among subtilisins from 
different sources. Using the subtilisin sequences compared by 
Siezen etal (1991), it can be seen that all three of the new 
amino acids in 13M are found in subtilisins from cyanobacteria 
(Arg48) and Gram-positive (Vall07 and Leu206) or Gram- 
negative bacteria (Leu206). One subtilisin (from Staphylo- 
coccus epidermis) contains two of the three new amino acids 
in 13M (Vail 07 and Leu206); none of the sequences compared, 
however, have all three. Thus the individual 'solutions' found 
by this directed evolution approach have indeed been explored 
by nature, albeit for other purposes. As noted previously (Chen 
and Arnold, 1993), at least seven of the 10 amino acid 
substitutions in 10M subtilisin E are found in subtilisins from 
other sources. The new 13M subtilisin variant can therefore 
be thought of as largely a novel 'mix and match* of other, 
naturally occurring enzymes. 

It is unknown how the mutations in the subtilisin gene 
reduce the expression of active enzyme in the 10M variant or 
why expression has been recovered in the 13M variant 
Mutations at the DNA level can affect the transcription and 
translation processes, while mutations at the protein level can 
affect the folding and processing of the enzyme. For example, 
the subtilisin maturation process requires a pro-sequence which 
acts as an intramolecular chaperon for the correct folding of 
the enzyme (Ikemura et a/., 1987). Changes in local secondary 
or tertiary structures resulting from amino acid substitutions 
could interfere with the maturation process and reduce enzyme 
yield. The enzyme yield was (purposefully) not maintained 
during the earlier directed evolution for enhanced specific 
activity in DMF. The amino acid * solutions' to this problem 
found in 10M subtilisin E all share the following features: 
they are located on the surface of the enzyme around the 
active site (although not necessarily adjacent) and they are all 
located in sequence- variable loops. When the screening criteria 
were modified in the current experiment to include an enhanced 
enzyme yield, the nature of the 'solutions' changed. That the 
three new amino acid substitutions in conserved secondary 
structures in 13M contrast with the positions of the 10 previous 
mutations in variable loops (Figure 4) hints at a mechanism 
for recovering total activity that occurs at the protein level. 

A basic rule of directed evolution (and any random mutagen- 
esis experiment) is, 4 you get what you screen for'. The screen 
(or selection) should therefore reflect the desired result as 
closely as possible. In particular, the standards for selecting a 
positive should include all features of interest, in order to 
rmnimize unwanted side-effects such as a lower expression 
level or changes in other features markedly affected by 
mutations at the DNA or amino acid levels. These realities 
often work against the use of a selection scheme that offers 
the ability to search larger fractions of sequence space for 
something no-one wants. 

Although screening limits the search for beneficial mutations 
to a very small fraction of sequence space, a workable strategy 
for directed evolution is to accumulate mutations conferring 
small enhancements in order to achieve the desired property 
(Arnold, in press). Sequential generations of error-prone PCR 



mutagenesis and screening is one effective method by which 
this can be accomplished. The accumulation of amino acid 
substitutions in an enzyme can result in significant improve- 
ments in performance, particularly for features not optimized 
under selective pressure. 
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Cao, H. and Shannon, J. C. 1996. BTl, a protein critical for in vivo starch accumula- 
tion in maize endosperm, is not detected in maize endosperm suspension cultures. - 
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Maize Btl is a structural gene for the major 39-44-kDa amyloplast membrane 
polypeptides. Loss of BTl in the brittle 1 (btl) mutant results in an 80% reduction in 
kernel starch. As an initial step toward the understanding of BTl function, we investi- 
gated the relation of BTl to starch accumulation in the two well -characterized maize 
endosperm suspension-cultured cell lines which were derived from 10 days post polli- 
nation (DPP) kernels of inbred A636 and 12-DPP kernels of the waxy mutant in the 
A636 background. Starch in A636 endosperm cultures accounted for about 1.5% of 
the fresh weight of cells but BTl was not detectable in amyloplast membranes or in 
microsomal membranes isolated from the cultured cells. Btl transcripts were detected 
in 10- and 20- DPP A636 kernels, but only a trace of Btl transcripts was detected in 
the suspension-cultured cells. Southern blotting indicated that the Btl gene was 
present in the genome of the cultured cells. The Btl gene products in A636 endosperm 
cultures were not increased by treatments which enhance starch accumulation such as 
sucrose supplementation, chlorocholine chloride (CCC) addition to the medium, or 
both. A similar pattern of differential Btl gene expression was found in the waxy en- 
dosperm cultures. These results indicate that starch accumulation in the cultured cells 
is not correlated with Btl gene expression. The significance of these findings is dis- 
cussed. 
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Miction 

pplastids are a group of subcellular organelles includ- 
|hlorop lasts, chromoplasts, amyloplasts, etioplasts, 
plasts and proplastids. Plastids contain a double- 
fTfirane envelope, stroma, and often internal mem- 
fe.as well as one or more starch granules (Boyer et al. 
ffi The envelope membrane proteins are essential in 
Mfting and translocation of nuclear-encoded proteins 
led to plastids (Keegstra et al. 1989, Schnell et al. 
P/and in die transport of various metabolites (Heldt et 
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al. 1991). At present, the transport processes and starch 
biosynthetic pathway(s) in nonphotosynthetic tissues are 
controversial (Boyer et al. 1989, Okita 1992, Villand and 
Kleczkowski 1994). The use of maize mutants in the 
study of starch biosynthesis has contributed much to an 
improved understanding of the in vivo mechanism of 
starch synthesis (Nelson and Pan 1995). The study of mu- 
tants including shrunken! {sh2), brittlel (bt2), amylose 
extender (ae) and waxy (wx) has helped to establish some 
of the critical steps in starch synthesis (Hannah et al. 
1993, Nelson and Pan 1995, Preiss 1991). 
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Kernels of the maize starch-deficient mutant brittle! 
(btl), first reported in 1926 (Mangelsdorf 1926, Wentz 
1926), accumulate about 20% as much starch as kernels 
of the normal genotype and sucrose content is increased 
by more than 500% (Tobias et al. 1992). Several lines of 
evidence support the conclusion that Brittle 1 protein 
(BT1) is an amyloplast membrane polypeptide and pos- 
sibly an adenylate translocator. First, the protein se- 
quence deduced from a Btl cDNA has a transit peptide 
at its amino terminus and a sequence similarity to sev- 
eral mitochondrial inner-envelope translocator proteins 
(Sullivan et al. 1991). Second, Li et al. (1992) showed 
that in vitro synthesized BT1 is imported into chloro- 
plast, processed into mature protein, and localized to the 
inner envelope membranes. Third, BT1 was shown to be 
localized to maize amyloplast envelope membranes by 
immunocytolocalization (Sullivan and Kaneko 1995) 
and by immunoblotting of polypeptides from isolated 
amyloplast membranes (Cao et al. 1995). BT1 is a clus- 
ter of four 39- to 44-kDa amyloplast-specific and inte- 
gral membrane polypeptides and in normal kernels diese 
polypeptides account for about 40% of the total amylo- 
plast membrane protein (Cao et al. 1995). Fourth, devel- 
oping btl kernels accumulated over 13-fold higher lev- 
els of adenosine 5' diphosphate glucose (ADP-Glc) than 
normal (Shannon et al. 1996). Finally, amyloplasts iso- 
lated from young kernels of the btl mutant were only 
25% as active in the uptake of ADP-Glc and conversion 
to starch as amyloplasts from normal and several other 
maize endosperm mutants (Liu et al. 1992). 

For several decades, in vitro cultures of maize en- 
dosperm cells grown in chemically defined media have 
been suggested as model systems for in vivo studies of 
starch biosynthesis (Chu and Shannon 1975, Shannon 
1982). The endosperm culture system has been charac- 
terized for cell growth (Chu and Shannon 1975), ceil fri- 
ability and ultrastructure (Felker 1987, Felker et al. 
1989), sugar uptake (Felker and Goodwin 1988), and 
starch characteristics (Saravitz and Boyer 1987). A tran- 
sient accumulation and mobilization of starch occur dur- 
ing a normal 7- or 14-day culture cycle. The cultured 
cells accumulate starch to about 5-10% of their dry 
weight at mid culture cycle and then it declines to a low 
level prior to subculture (Chu and Shannon 1975). Chu 
and Shannon (1975) also reported that starch content 
was increased up to 12 to 14% of the dry weight by sup- 
plementation of the cultures several days after subcul- 
ture with additional sucrose. Thus, clearly these cultured 
maize endosperm cells are capable of starch biosynthesis 
and we were therefore interested in using them to study 
BT1 function. As an initial step, we investigated the re- 
lation of BT1 to starch accumulation in the two well- 
characterized maize endosperm suspension-cultured cell 
lines initially derived from 10 days post pollination 
(DPP) endosperm of the normal A636 inbred, and from 
the endosperm of 12- DPP wx mutant kernels. Both sus- 
pension-cultured cell lines, A636 and wx, contained the 
Btl gene (as detected by Southern blotting), but only 
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traces of Btl transcripts were detected by northern 
ysis. The presence of BT1 polypeptide was belowfl 
level of detection by immunoblot assay. Our results inf 
cate that starch accumulation in the cultured cells istv 
correlated with the presence of BT1 in the amylo|f 
membrane. 



Abbreviations - ADP-Glc, ADP-glucose; AGPasc, ADpJ? 
pyrophosphorylase; ae, amylose extender, btl, brittle] ;L 
Brittle] gene; BT1, Brittle 1 protein or Brittlel po]ypepiid| 
CCC, chlorocholine chloride; DPP, days post pollination; Pljjf 
phenylmethylsulfonyl fluoride; wx, waxy. 



Materials and methods 
Plants, chemicals and manipulation 

Maize (Zea mays L. inbred A636 and the endosp 
mutant, waxy, in a near isogenic A636 backgrou|] 
plants were grown in a greenhouse as described preT 
ously (Tobias et al. 1992). The plants were self- orf 
pollinated. Ears were quick-frozen in liquid mtrogerl 
stored at -70°C until used, except for fresh earsf 
were used to isolate amyloplasts and amyloplast mil 
branes. The common reagents were purchased frorn§ 
ther Sigma (St Louis, MO, USA) or Fisher Scientf 
(Pittsburgh, PA, USA). Special reagents and kits ar§j 
dicated in the text. General molecular biological ft 
niques followed the standard procedures (Ausubel % 
1993, Sambrook et al. 1989). Manipulations of m| 
branes and proteins were the same as those previojf 
described (Cao et al. 1 995). 

Maize endosperm suspension cultures 

Maize endosperm suspension cultures were derived fr| 
10-DPP endosperm of inbred A636 and 12-DPP|i 
dosperm of the wx mutant in a near isogenic A636 b|| 
ground according to the procedure of Shannon (19| 
Prior to this study, the endosperm suspension cultun 
A636 and wx had been in culture for about 7 and 3 y| 
respectively. The suspension cultures were routin 
grown in 250-ml Erlenmeyer flasks containing 80 mj| 
medium devoid of added growth regulators according 
the protocol of Shannon (1982). To enhance starch acg 
mulation in some experiments, cells were subcultug 
into the standard medium (80 ml) supplemented withM 
ml of a filter-sterilized chlorocholine chloride (Cg 
stock to yield 90 ml of medium containing 100 uM f| 
or cells subcultured to the standard medium were suf||| 
mented 3 days after subculture with 10 ml of a steril||j 
lution containing 8 g of sucrose (Chu and Shan 
1975). In the combination treatment, cells subcultuj 
into the CCC-containing medium were also su|)||j 
mented with sucrose as above (final volume = lOO-^I 
All cultures were harvested 7 days after subculture^ 
days after sucrose supplementation) by vacuum fil^S 
and washed twice with distilled water. Fresh cells vp 
used to isolate amyloplast membranes. DNA, RNA,|| 
microsomal membranes were isolated from frozen cel^ 
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Action and determination of starch and structural 
§|acclia«*i des 

mi (2 g of fresh weight) were homogenized with a 
on (Brinkmann Instruments, Westbury, NY, USA) 
nes for 10 s each in 15 ml of ice-cold MCW (metha- 
||hloroform:water, 13:4:3, v/v/v). The homogenate 
fcentrifuged at 800 g for 10 min. The pellet was 
fed 3 times by suspension in 10 ml of MCW and 
Ifugation as above. The washed pellet was sus- 
||d in 15 ml of 10% ethanol, held overnight at 4°C 
_ hen centrifuged at 3 500 g for 20 min. The pellet 
ffeextracted 3 times by suspension in 10 ml of 10% 
j|p] and centrifuged at 3 500 g for 20 min. All ex- 
gbns were done in the cold (~4°C). The final pellet, 
frded as total 10% ethanol-insoluble polysaccha- 
Htotal polysaccharides), was suspended in deion- 
later by heating for 15 min in a boiling water bath. 
^henol-H 2 S0 4 procedure (Dubois et al. 1956) was 
Mb estimate total polysaccharide content. To deter- 
gistarch content, an aliquot of the suspended total 
ffkccharide fraction was digested overnight at 37 °C 
ml units ml" 1 of amyloglucosidase (Sigma) in 0.5 M 
^frn acetate, pH 4.8. The digest was heated in a boil- 
later bath for 1 min, centrifuged at 20000 g for 10 
land the quantity of glucose released determined by 
pi's reducing sugar test (Hodge and Hofreiter 
^Structural polysaccharide content was calculated 
% difference between total polysaccharides and 



gemination of a-amylase activity 

Intcells were homogenized with a Polytron 4 times 
ts each in 3 volumes of extraction buffer contain- 
M)0 mM potassium phosphate buffer (pH 7.0), 5 
^thylenediaminetetraacetic acid (EDTA), and 1 
jithioerythritol. Following filtration through one 
jK'f Miracloth and centrifugation at 3 500 g for 30 
|fhe supernatant was used to determine a-amylase 
Wiy by the starch azure assay (Doehlert and Duke 
Mi 



fiction of protoplasts 

K| isolating amyloplast membranes from maize en- 
perm suspension-cultured cells, protoplasts were 
fficed from the cultured ceils as follows. The suspen- 
fficuitured cells derived from A636 (100 g of fresh 
£it) were plasmolyzed in 2x digestion medium (lx 
|tion medium: 50 mM 2-[N-moipholino]ethane- 
|qnic acid [MES], pH 5.6, 1 mM KC1, 1 mM MgCl 2 , 1 
£||pTA, 0.5 M sorbitol) at 30°C for 30 min and then 
||ed in 2 ml of lx digestion medium containing 
ffi|ml _1 cellulase (Sigma, C-0901 from Penicillium 
gpojiim) and 10 U ml" 1 pectinase (Sigma, P-9932 
\M s P er giHus niger) per gram fresh weight of cells. 
j||tion of the cells was carried out in a 3.5-1 flat-bot- 

w 
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tomed glass vessel (20 cm in diameter). After 2 h at 
30° C with gentle agitation, release of protoplasts was 
complete and the vessel was inclined and protoplasts and 
debris allowed to settle at 1 g for 30 min. The superna- 
tant was carefully syphoned off and replaced with half 
original volume of digestion medium less enzymes. The 
resulting suspension was then filtered through Miracloth 
and spun at 14 g for 15 min. The supernatant was re- 
moved and protoplast resuspended in 1 ml of homogeni- 
zation buffer (50 mM N-[2-hydroxyethyl]piperazine-A^- 
[2-ethanesulfonic acid] [HEPES], pH7.5, 10 mM KC1, 1 
mM MgCl 2 , 1 mM EDTA, 1 mM dithiothreitol [DTT], 1 
mM phenylmethylsulfonyl fluoride [PMSF], and 0.5 M 
sorbitol) minus PMSF and DTT per gram of cells and 
centrifuged as above. The washed and pelleted proto- 
plasts were suspended in 2 volumes of homogenization 
buffer. 



Isolation of amyloplast membranes and marker enzyme 
analysts 

Amyloplast membranes from developing kernels were 
isolated as described previously (Cao et al. 1995). Amy- 
loplast membranes from A636 endosperm cultures were 
isolated from the purified protoplasts as follows. Proto- 
plasts were ruptured either by homogenization for 1 s in 
a VirTis 23 homogenizer (The VirTis Company, Gar- 
diner, NY, USA) operated at maximum power as de- 
scribed by Cao et al. (1995), or by two passages through 
a nylon mesh (10 um opening) positioned at the lower 
end of a 50-ml disposable plastic syringe as described by 
Shannon et al. (1987). The homogenate or lysate was 
passed through one layer of 10-um pore-sized nylon. 
The filtrate was centrifuged at 100 g for 15 min to pellet 
the amyloplasts. The amyloplasts were suspended in ho- 
mogenization buffer and purified through a Percoll den- 
sity gradient consisting of 20, 30, and 40% of Percoll in 
homogenization buffer minus PMSF and DTT. Amylo- 
plasts in the 20/30% and 30/40% interfaces and pellet 
layer were collected. The purity of amyloplasts was de- 
termined by marker enzyme analyses using the com- 
monly used marker enzymes, i.e. alcohol dehydrogenase 
(cytosol), cytochrome c oxidase (mitochondria), catalase 
(microbodies), cyamde-insensitive NADH-cytochrome 
c reductase (ER) as described previously (Cao et al. 
1995). Amyloplast membranes were isolated from the 
purified amyloplasts using the same procedure as that 
used for developing endosperm (Cao et al. 1995). 

Isolation of microsomal membranes 

The procedure for isolating microsomal membranes 
from developing kernels was described previously (Cao 
et al. 1995). A similar procedure was used to isolate mi- 
crosomal membranes from suspension-cultured en- 
dosperm cells. The microsomal membrane pellet was 
suspended in TES buffer (10 mM Tricine, pH 7.2, 1 mM 
EDTA, 0.2 M sucrose). 
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Protein determination, SDS-PAGE and western blotting 

Protein content was determined as described (Cao et al. 
1995) with the Bio-Rad protein assay (Bio-Rad Labora- 
tories, Richmond, CA, USA) and using bovine serum al- 
bumine (BSA) as a standard. Sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis (SDS-PAGE) fol- 
lowed that of Sambrook et aL (1989). The western blot- 
ting procedure used was detailed previously (Cao et al. 
1995). Briefly, after electrophoresis, the polypeptides in 
the gel were electroblotted onto a piece of nitrocellulose 
membrane and incubated sequentially with BT1 anti- 
body and with the secondary antibody, goat anti-rabbit 
IgG-alkaline phosphatase conjugate (Immun-Blot Assay 
Kit, Bio-Rad Laboratories). Finally, the target polypep- 
tide(s)-primary and secondary antibody conjugates were 
detected using the color reaction agent, 5-bromo-4- 
chloro-3-indolyl phosphate and nitro blue tetrazolium 
chloride (Immun-Blot Assay Kit, Bio-Rad Laborato- 
ries). The primary antibodies were raised against a fu- 
sion protein formed from 56 amino acids of the carboxyl 
terminus encoded by the Btl cDNA and glutathione-S- 
transferase (Sullivan and Kaneko 1995). Each western 
blot was repeated at least 3 times. 

RNA isolation and northern blotting 

Total RNA was isolated from the endosperm of develop- 
ing kernels and suspension- cultured endosperm cells by 
the hot phenol procedure of de Vries et al. (1988). The 
pBtlcDNAl.7 plasmid (Sullivan et al. 1991) was trans- 
formed into E. coli bacteria (XL1) by the rapid freezing 
method (Takahashi et al. 1992) and produced by the 
method of Lee and Rasheed (1990). The Btl cDNA 
probe was prepared from the purified Btl cDNA inserts 
using the Random Primed DNA Labeling Kit (Boe- 
hringer Mannheim Biochemicals, Indianapolis, IN, 
USA). RNA was separated in 1% agarose gel in 3-(N- 
morpholino)propanesulfonic acid (MOPS) buffer (Four- 
ney et al. 1990). The RNA integrity and equal-loading of 
the samples were confirmed by examining band sharp- 
ness and staining intensity of the 28S and 18S rRNA 
bands on the ethidium-bromide- stained gel (Ausubel et 
al. 1993, Cao et al. 1996). The RNA was then trans- 
ferred to a piece of nylon membrane (Hybond™-N, Am- 
ersham International pic, Amersham, UK) by capillary 



blotting (Sambrook et al. 1989). Following M 
crosslinking, the blot was prehybridized for 1 to 11 
65°C in a buffer containing 5x SSPE (lx SSPE: 0.1 
NaCl, 10 mM sodium phosphate, pH 7.7, 1 mM EDJ 
(Ausubel et al. 1993), 5x Denhardt's solution (Au|f 
et al. 1993), and 0.5% (w/v) SDS, and hybridized f|| 
to 20 h at 65 °C After hybridization, the blot was wall 
twice for 10 min each at room temperature in 2x sl| 
containing 0.1% SDS, once for 15 min at 65°C iff 
SSPE containing 0.1% SDS solution, and finally ff| 
min at 65 °C in 0.1 x SSPE containing 0.1% SDS 1 " 
autoradiography. Northern hybridizations were rep||| 
at least 3 times. 



DNA isolation and Southern blotting 

DNA was isolated from developing kernels and|| 
suspension-cultured endosperm cells following thej) 
tocol of Junghans and Metzlaff (1990). Following EM 
digestion, DNA fragments were separated on 0.7%f| 
ose gel in TAE buffer (40 mM Tris-acetate, lit 
EDTA), and transferred to a piece of Nylon-N + 
brane with 0.4 M NaOH for 18 to 20 h by capillary t 
fer (Sambrook et al. 1989). The blot was prehybrij| 
at 65 °C for 4 h in a buffer containing 0.5 M Na 2 F~ 
pH 7.2, 7% SDS, 100 pg ml" 1 salmon sperm :" 
(Sigma), and then hybridized at 65°C for 16 to 201 
the same buffer plus freshly denatured Btl cDNA 
prepared as above for northern blotting. After hybridy 
tion, the blot was washed at room temperature for f|j 
in 2x SSC (lx SSC: 0.15 M NaCl, 15 mM sodiumi 
rate, pH 7.0) (Ausubel et al. 1993) plus 0.5% SDW 
min in lx SSC plus 0.1% SDS, 30 min in 0.1 x SS(| 
0.5% SDS, and finally at 65°C for 30 min in 0.1x|| 
plus 0.5% SDS before autoradiography. 

Results 

Characterization of starch accumulation in A636endosg| 
cultures 

Before we examined Btl gene expression in mai2|p 
dosperm suspension-cultured cells, we characteM 
starch accumulation in the A636 derived endospenn^ 
tures. After a 7-day culture cycle, the fresh weight on 
endosperm cultured cells increased from approxirria| 
2.1 g to 11.2 g (80 ml suspension)" 1 (Tab. 1). ~ 



Tab. 1. Starch accumulation in cells from A636 endosperm suspension cultures. A636 endosperm cell cultures were subcultured intoj 
daid medium with or without CCC as described in Materials and methods. Cultures inoculated at a start density of approximately^ 
flask - ' were grown for 7 days. Data are the average of two flasks plus or minus the SE. 
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Measurement 


Control 


+ CCC 


% of ControI _^^H 


Wmf' — 




Fresh weight (g flask"') 

Total polysaccharides (mg flask -1 ) 

Starch (mg flask"') 

Structural polysaccharides (mg flask -1 ) 
a-Amylase (umol min" 1 flask j 


11.2 ±0 
274.1 ±18.2 
169.8 ±11.9 
104.3 ±6.3 

19.7 ±1.6 


13.0 ±0.2 
376.1 ±28.8 
225.1 ± 14.8 
151.0 ±14.0 

17.6 ±0.2 


H6.1 iiiii 

137.2 3fHH 

144.8 wB 

132.6 £Bb| 
89.3 


^^^ogenate 
^P^Jamybplast 
IliiiBSSS amyloplast 

H^^aitment 


18. 
1. 
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Itures were subcultured int§ 
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% of Control §j 

116.1 
137.2 
144.8 
132.6 
89.3 



^saccharides made up about 2.4% of the fresh 
&|ht. with starch accounting for 62% of the total 
^saccharides (Tab. 1). CCC (100 uM), an inhibitor of 
TBerellic acid (GA 3 ) biosynthesis (Rademacher 1991), 
||ed an increase in fresh weight (16%), starch (45%), 
jjftructural polysaccharides (32%), but a-amylase ac- 
m ; remained essentially unchanged (Tab. 1). These re- 
vindicate that A636 endosperm cultures were com- 
gft for starch biosynthesis and accumulation. 



ation of amyloplast membranes from A636 endosperm 

(Kfres 

;ently reported that BT1 is a family of abundant 
jptides specific to amyloplast envelope mem- 
j|s (Cao et al. 1995). Amyloplasts and amyloplast 
ibranes were isolated from A636 suspension-cul- 
H endosperm cells to confirm the presence of BT1 in 
Ibranes of in vitro cultured maize endosperm cells, 
jylpplasts in these cultures were smaller than those in 
pally developing maize kernels and in addition to the 
loplasts in the pellet substantial quantities of amylo- 
j§ were also located at the interfaces of the 20/30% 
|p/40% Percoll layers during the discontinuous gra- 
ijlpurification. Thus for this study amyloplasts from 
)o interfaces were combined with those in the pel- 
[arker enzyme analyses (Tab. 2) indicate that the 
IBined "purified amyloplast fraction" was free of the 
Isol marker, alcohol dehydrogenase, and was only 
jfitly contaminated by mitochondria (1.6% of the total 
j||hrome c oxidase activity), ER (1.4% of the total 
)H-cytochrome c reductase activity), and rnicrobod- 
g.4% of the total catalase activity). 
||l!owing removal of starch granules and the stromal 
|pns, the membrane fractions derived from the puri- 
|amyloplasts were yellow in color, a positive marker 
tyloplast envelopes from maize endosperm (Cao et 
l|?5), potato tubers (Fishwick and Wright 1980), and 
"lore suspension-cultured cells (Alban et al. 1988, 
linasut et al. 1988). In addition since the "amyloplast 
ijbrane fraction" was isolated from purified amyio- 
| it is concluded that this membrane fraction is 
fily enriched in amyloplast membranes. 
& 



bti n 

(39-44 kDa) 





Fig. 1. Western blot analysis of BTI in amyloplast membranes 
isolated from the endosperm of 12- to 15-DPP A636 kernels and 
from suspension-cultured A636 endosperm cells. Each lane was 
loaded with 4 pg of amyloplast membrane protein. 

BTI not detected in amyloplast membranes from A636 
endosperm cultures 

As reported for amyloplast membranes from W64A 
(Cao et al. 1995), amyloplast membranes isolated from 
the endosperm of 12- to 15-DPP A636 kernels contained 
polypeptides (39 to 44 kDa) which reacted with the BTI 
antibodies (Fig. 1). To our surprise, amyloplast mem- 
branes isolated from suspension-cultured maize en- 
dosperm cells lacked detectable levels of BTI antibody- 
reacting polypeptides (Fig. 1). 



Comparison of Btl gene expression in A636 endosperm cells 
growing in vivo and in vitro 

Previously we reported that immunoblots (westerns) of 
polypeptides from amyloplast membranes and total mi- 



b|2. Purity of amyloplasts isolated from A636 endosperm cultures. Amyloplasts were isolated from A636 endosperm cultures as de- 
|b|d in Materials and methods. Protein and marker enzymes were assayed following filtration through 10-um pore-sized nylon (homo- 
gpate), in the first 100 g centrifugal ion pellet (crude amyloplast), and in the 100 g Percoll density gradient fraction (purified amyloplast). 

nples were suspended in homogenization buffer and lysed by one freeze and diaw cycle before protein content and marker enzyme 
Ipyties were determined. Data are the mean plus SE of 2 determinations and are presented as nmol or umol per minute per gram of fresh 
* ^jt of endosperm cells as indicated. * Cyanide-insensitive portion. 



Protein 
% 



Jjgggenate 
^amyloplast 
jfied amyloplast 



100 
18.9 ±2.0 
1.3 ±0.1 



>wjaiuui/-ni^iiauin J put uuji. 

Alcohol dehydrogenase Cyt-c oxidase NADH-cyt-c reductase* Catalase 

(nmol g - ' FW min - ') (nmol g - ' FW min -1 ) (nmol g" 1 FW min"') (umol g~' FW min -1 ) 

i«ixinn w c 4. n tot n a. mo « iaoa-th 



163.2 ± 10.0 
10.5 ± 1.0 
0 

Cytosol 



30.8 ±1.7 
7.2 ±1.7 
0.5±0.1 

Mitochondria 



387.7 ± 108.8 
125.7 ±20.0 
5.3 ±0.8 

ER 



24.9 ±2.7 
13.2 ±0.6 
1.1 ±0.1 

Microbodies 
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Fig. 2. Western and northern blot analysi 
BT1 and Btl gene expression, respective])® 
extracts from A636 kernels and suspense 
cultured A636 endosperm cells. A, Wesic) 
blot analysis of BT1 in microsomal membraf 
from A636 kernels and cells from A636 enS 
sperm suspension cultures. Each lane SB 
loaded with 30 pg of microsomal membra 
protein. B, Northern blot analysis of Btl tn 
scripts in A636 kernels and cells from Al| 
endosperm suspension cultures. Left and nl 
lanes were loaded with 12 ug of total R||| 
mid-die lane with 4 ug of total RNA. J?L 



crosomal membranes have identical banding patterns 
when reacted with antibodies to BT1 (Cao et al. 1995). 
Thus to increase the total yield of amyloplast mem- 
branes we isolated microsomal membranes from the en- 
dosperm of A636 kernels 10 and 20 DPP and from the 
suspension-cultured endosperm cells, originally initiated 
by culturing endosperm from 10-DPP A636 kernels. 
BT1 antibody-reacting polypeptides were present in mi- 
crosomal membranes from 20-DPP kernels but were not 
detected in microsomal membranes from 10-DPP A636 
kernels or from A636 suspension-cultured endosperm 
cells (Fig. 2A). The lack of detectable BT1 in microso- 
mal membranes from 10-DPP kernels and A636 en- 
dosperm cultures suggests a possibility that cell cultures 
derived from 10- or 12-DPP kernels retained the same 
developmental stage as cells in the endosperm of young 
kernels. To test this possibility, we isolated total RNA 
from samples identical to those used for the western blots 
in Fig. 2A. Northern blot analysis indicated that high lev- 
els of Btl transcripts were detected in both 10- and 20- 
DPP kernels (Fig. 2B) while only a very weak band was 
detected in extracts from the cell culture samples. (This 
very weak band, although not visible in Fig. 2B, may be 
seen in Fig. 3B.) Southern blotting analysis showed DNA 
fragments of similar size in developing kernels and in 
cells from endosperm cultures hybridized with the Btl 



Cell Culture 



cDNA probe (data not shown), suggesting that thejl 
gene was present in the endosperm cultures. These 
suits indicate that the Btl gene may be differential^ 
pressed in endosperm cells growing in vivo and in viti 

Btl gene products are not induced by sucrose or CCC m 

Starch accumulation in suspension-cultured maiz 
dosperm cells is enhanced by the addition of CCC iW 
medium (Tab. 1) and by supplementation of the culjf 
with additional sucrose (Chu and Shannon 1975). Tog 
termine if increased starch synthesis in cultures waff 
related with Btl gene expression, we examined thjj 
fects of sucrose, CCC, and sucrose plus CCC on] 
gene expression in suspension-cultured maizejg 
dosperm cells. None of the treatments designed ffi 
crease starch synthesis in the maize endosperm cull 
affected gene expression (Fig. 3B) or translation^ 
3A). 



Comparison of Btl gene expression in waxy endospernrjM] 
growing in vivo and in vitro 

To determine if the reduced Btl gene expression J| 
characteristic of the 7 -year-old A636 endosper 
hires, we investigated Btl gene expression in su|| 



B 
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Fig. 3. Btl gene expression and translation in A636 suspension-cultured endosperm cells following supplementation with sy 
and the addition of CCC. A, Western blot analysis of BT1 in microsomal membranes from A636 endosperm suspension cun^ 
cells. Polypeptides from microsomal membranes of a 20-DPP kernel are included as a control. Each lane was loaded with 3^ 
microsomal membrane protein. B, Northern blot analysis of Btl transcripts in cells from A636 endosperm suspension culture 
20-DPP kernel (control). Each lane was loaded with 12 pg of total RNA. 
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M Western and northern blot analyses of 
fp^ £ t l gene expression, respectively, in 
llvfrorn wx mutant kernels and suspens- 
Wtur^d endosperm cells initiated from wx 
W A Western blot analysis of BTl in mi- 
fai membranes from wx kernels and cells 
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line protein. B, Northern blot analysis of 
Inscripts in wx kernels and cells from wx 
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llqitured wx endosperm cells. These cultures were 
3 years before use, from 12-DPP endosperm tis- 
M the wx mutant genotype in an A636 background. 
Jfevels of BTl and Btl transcripts were detected by 
jjjg£ and northern blotting, respectively, in en- 
flm from normally developing 14-DPP wx kernels, 
' |d%BT1 (Fig. 4A) and only a trace of Btl message 
IKb) was detected in extracts from the wx en- 
Wm cultures. Genomic Southern blotting showed 
Re Btl gene was conserved in the wx endosperm 
§r|s (data not shown). 
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ilfhe product of Btl , a nuclear encoded gene (Sulli- 
ffal. 1991), appears to have an important function 
W biosynthesis of starch in normally developing 
^kernels because in its absence, i.e. in kernels of 
i|f mutant genotype, starch accumulation is reduced 
||ut 80% (Nelson and Pan 1995, Tobias et al. 1992). 
pearlier paper (Cao et al. 1995) we reviewed evi- 
If supporting the hypothesis that BTl is an adenylate 
locator and suggested that it may function in the 
ifer of ATP, a substrate for AGPase. Although BTl 
Bits for about 40% of the polypeptides in amylo- 
Imembranes from maize endosperm, membranes 
tfphloroplasts and tissues other than maize en- 
jf|m, lack detectable quantities of BTl polypeptide 
Met al. 1995). If BTl is an adenylate translocator 
p^functions in the transfer of ATP into amyloplasts 
jfchange for ADP, a product of starch synthase, then 
Jjpplast membranes, regardless of the tissue source, 
SaW be expected to contain an adenylate translocator. 
W either amyloplasts from other tissues may contain 
p| homolog not recognized by the BTl antibodies 
a|n this study or maize endosperm amyloplasts have 
|||iue amyloplast membrane polypeptide functioning 
**H transfer of substrates for starch synthesis. 
IjAGPase is localized in the amyloplast stroma as 
fally accepted (Okita 1992, Smith et al. 1995), the 
Ipplast membranes must also contain a carbohy- 
l translocator. Amyloplasts from several nonphoto- 
Ketic tissues have been shown to take up hexose-Ps 
pe as substrates for starch synthesis (Borchert et al. 
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1989, Heldt et al. 1991, Hill and Smith 1991, Kosegar- 
ten and Mengel 1994, Schott et al. 1995). Amyloplasts 
isolated from young maize endosperm readily took up 
and converted ADP-Glc to starch, but they were not 
able to take up hexose-Ps to be used as substrates for 
starch synthesis (Liu et al. 1992). Thus the preferred 
carbohydrate transferred into plastids to provide sub- 
strate for starch synthesis varies with the tissue and spe- 
cies. In fact, the specificity of translocators may be 
modified by environmental or experimental conditions. 
For example, spinach chloroplasts specifically transfer 
triose-Ps in exchange for inorganic phosphate (Flugge 
and Heldt 1991), but it was recently shown that ex- 
tended incubation of spinach leaves in glucose induced 
formation of a phosphate translocator capable of trans- 
ferring Glc-6-P (Quick et al. 1995). In addition, amylo- 
plast membranes from developing potato tubers specifi- 
cally transferred Glc-6-P, and not Glc-l-P, in exchange 
for Pj (Schott et al. 1995) while amyloplast membranes 
from potato tuber cells grown in suspension cultures 
transferred Glc-l-P rather than Glc-6-P (Kosegarten 
and Mengel 1994). 

Carbohydrate metabolism in suspension-cultured 
maize endosperm cells differs from that in endosperm 
cells of intact kernels. For example, the cultured cells 
produce transient starch; i.e. low quantities of starch ac- 
cumulate early in the culture cycle and are then metabo- 
lized as the medium sugar is depleted (Chu and Shannon 
1975). High a-amylase activity was observed in the me- 
dium and in the suspension-cultured maize endosperm 
cells (G. Mohabir, K.-C. Liu, C. D. Boyer and J. C. 
Shannon, unpublished data), but relatively little a-amy- 
lase activity was detected in the endosperm of develop- 
ing kernels (Pharis and King 1985). Yu et al. (1991) re- 
ported that starvation of rice suspension cultures for only 
2-4 h induced a-amylase gene expression. The quantity 
of transient starch in the maize endosperm cultures was 
increased by supplementation of the cultures with addi- 
tional sugar (Chu and Shannon 1975) and the addition of 
CCC (Tab. 1). The results shown here indicate that the 
Btl gene present in suspension-cultured maize en- 
dosperm cells was only very weakly expressed and no 
BTl product was detectable in membranes from control 
or sucrose/CCC-supplemented cultures. Thus starch ac- 
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cumulation in suspension-cultured maize endosperm 
cells was not dependent on the presence of BT1 in the 
amyloplast membranes. At present the reasons for differ- 
ential Btl gene expression in maize endosperm cells 
growing in vivo and in vitro are unknown. However, the 
in vitro-grown maize endosperm cells should be useful 
for studies designed to identify and characterize alterna- 
tive membrane polypeptides functioning in the transport 
of substrates for starch synthesis. 

Acknowledgments - We thank Dr Thomas D. Sullivan (Univer- 
sity of Wisconsin, Madison) for the BT1 antibody and the 
pBtlcDNA1.7 plasmid, DrKang-Chien Liu, Christina Richards, 
Dane K. Fisher, Ming Gao and Kyung-Nam Kim for technical 
assistance. 



References 

Alban.C, Joyard, J. & Douce, R. 1988. Preparation and charac- 
terization of envelope membranes from nongreen plastids. - 
Plant Physiol. 88: 709-717. 

Ausubel, F. M., Brent, R., Kingston, R. E., Moore, D. D., 
Seidman, J. G., Smith, J. A. & Struhl, K. 1993. Current Pro- 
tocols in Molecular Biology. - Greene Publishing Associates 
Wiley Interscience, New York:, NY. ISBN 0-471-50338-X. 

Borchert, S., Grobe, H. & Heldt, H. W. 1989. Specific transport 
of inorganic phosphate, glucose 6-phosphate, dihydroxyace- 
tone phosphate and 3-phosphogly cerate into amyloplasts 
from pea roots. - FEBS Lett. 253: 183-186. 

Boyer, C. D., Shannon, J. C. & Hardison, R. C. 1989. Physiol- 
ogy, Biochemistry, and Genetics of NongTeen Plastids. - 
Am. Soc. Plant Physiologists, Rockville, MD. ISBN 0- 
943088-15-1. 

Cao, H., Hu, Z. A., Tan, K. & Jin, Z.-H. 1996. DNA homo- 
logues in some species and expression patterns in cucumber 
organs of genes corresponding to cDNAs from carrot so- 
matic embryogenesis cultures. - J. Plant Physiol. (In press). 

- , Sullivan, T. D., Boyer, C. D. & Shannon, J. C. 1995. Btl, a 
structural gene for the major 39-44 kDa amyloplast mem- 
brane polypeptides. - Physiol. Plant. 95: 176-186. 

Chu, L.-J. C. & Shannon, J. C. 1975. In vitro cultures of maize 
endosperm - a model system for studying in vivo starch bio- 
synthesis. - Crop Sci. 15: 814-819. 

de Vries, S., Hoge, H. & Bisseling, T. 1988. Isolation of total 
and polysomal RNA from plant tissues. - In Plant Molecular 
Biology Manual (S. B. Gelvin, R. A. Schilperoort and D. P. 
S. Verm a, eds), pp. 1-13. Kluwer Academic Publishers, 
Dordrecht. ISBN 0-7923-0236-2. 

Doehleit. D. C. & Duke, S. H. 1983. Specific determination of 
a-amylase activity in crude plant extracts containing ^-amy- 
lase. - Plant Physiol. 71: 229-234. 

Dubois, M., Gilles, K. A., Hamilton, J. K., Rebers, P. A. & 
Smith, F. 1956. Colorimetric method for determination of 
sugars and related substances. - Anal. Chem. 28: 350. 

Felker, C. F. 1987. Ultrastructuie of maize endosperm suspen- 
sion cultures. - Am. J. Bot. 74: 191 2-1920. 

- & Goodwin, J. C. 1988. Sugar uptake by maize endosperm 
suspension cultures. - Plant Physiol. 88: 1235-1239. 

- , Miemyk, J. A. & Crawford, C. G. 1989. Characterization 
of maize en do sperm -derived suspension cells through the 
culture cycle and enhancement of tissue friability. - Plant 
Cell Tissue Organ Cult. 18: 153-165. 

Fishwick, M. T. & Wright, A. J. 1980. Isolation and character- 
ization of amyloplast envelope membranes from Sofanum 
tuberosum. - Phytochemistry 19: 55-59. 

Flugge, U.-L & Heldt, H. W. 1991. Metabolite translocators of 
the chloroplast envelope. - Annu. Rev. Plant Physiol. Plant 
Mol.Biol. 42: 129-144. 



672 



Fourney, R. M., Miyakoshi, J., Dy, R. S. Ill & Paterson, , 
1990. Northern blotting: efficient RNA staining and tral 
-Focus 10: 5-7. ps 

Hannah, L. C, Giroux, M. & Boyer, C. D. 1993. Biotech^, 
cal modification of carbohydrate for sweet maize impl 
ment. - Sci. Hortic. 55: 177-197. | 

Harinasut, P., Akazawa, T., Zhou, F.-G. & Takabe, T.M 
Polypeptide compositions of stroma, internal membnt 
and inner and outer envelope membranes of amylo? 
from suspension-cultured cells of sycamore (Acer 
platanus L.). - Plant Cell Physiol. 29: 1315-1321. § 

Heldt, H. W, Flugge, U.-L & Borchert, S. 1991. Divert] 
specificity and function of phosphate translocators in| 
ous plastids. - Plant Physiol. 95: 341-343. f 

Hill, L. M. & Smith, A. M. 1991. Evidence that glucose 6§ 
phate is imported as the substrate for starch synthesis H 
plastids of developing pea embryos. -Planta 185: 91-Jl 

Hodge, J. E. & Hofreiter, B. T. 1962. Determination of redw 
sugars and carbohydrates. - In Methods in Carhop 
Chemistry (R. L. Whistler and M. L. Wolfram, eds)A\" 
pp. 380-394. Academic Press, New York, NY. ISBN? 
746206-6. 1 

Junghans, H. & Metzlaff, M. 1990. A simple and rapid mp 
for the preparation of total plant DNA. - Biotechniqf* 

Keegstra, K., Olsen, L. J. & Theg, S. M. 1989. ChloroplasUj 
cursors and their transport across the envelope membra? 
Annu. Rev. Plant Physiol. Plant Mol. Biol. 40: 47l-50j| 

Kosegarten, H. & Men gel, K. 1994. Evidence for a glucf| 
phosphate translocator in storage tissue amyloplastsfl 
tato (Solanum tuberosum) suspension-cultured cells. 
iol. Plant. 91: 111-120. 

Lee, S.-Y. & Rasheed, S. 1990. A simple procedure fori 
mum yield of high-quality plasmid DNA. -Biotechnioj 
676-679. f 

Li, H.-M., Sullivan, T. D. & Keegstra. K. 1992. Informati^ 
, targeting to the chloroplastic inner envelope membra; 
contained in the mature region of the maize Btl ' 
protein. - J. Biol. Chem. 267: 18999-19004. 

Liu, K.-C, Boyer, C. D. & Shannon, J. C. 1992. Carbohl 
transfer into isolated maize (Zea mays L.) amyloplast 
stract No. 234). - Plant Physiol. 99: S-39. Jf 

Mangelsdorf, P. C. 1926. The genetics and morphology oj 
endosperm characters in maize. - Conn. Agric. ExBj 
Bull. 279: 509-614. f 

Nelson, O. & Pan, D. 1995. Starch synthesis in mail 
dosperms. - Annu. Rev. Plant Physiol. Plant Mol. Bij; 
475-496. I 

Okita, T. W. 1992. Is there an alternative pathway for staff 
thesis? - Plant Physiol. 100: 560-564. I 

Pharis, R. P. & King, R. W. 1985. Gibberellins and reprodl 
development in seed plants. - Annu. Rev. Plant Physiol 
517-568. f! 

Preiss, J. 1991. Biology and molecular biology of starch^ 
thesis and its regulation. - In Oxford Survey of Pl^g! 
lecular and Cellular Biology, Vol. 7 (B. J. Miflin, 
59-114. Oxford University Press, Oxford. ISBNf 
857750-8. jL 

Quick, W. P., Scheibe, R. & Neuhaus, H. E. 1995. Induct^ 
hexose-phosphate translocator activity in spinach cgi 
plasts.- Plant Physiol. 109: 113-121. ' | 

Rademacher, W. 1991. Inhibitors of gibberellin biosynthe'sj 
plications in agriculture and horticulture. - In Gibhef 
(N. Takahashi, B. O. Phinney and J. MacMillan, cd? 
296-310. Springer- Verlag, New York, NY. ISBNli 
97259-5. | 

Sambrook, J., Fritsch, E. F. & Maniatis, T. 1989. Mol 
Cloning: A Laboratory Manual, 2 Ed. - Cold Spring || 
Laboratory, Cold Spring Harbor, NY. pp 18.47- 1S.74£ 
0-87969-309-6. ' I 

Saravitz, C. H. & Boyer, C. D. 1987. Starch characterisg 
cultures of normal and mutant maize endosperm. - 
Appl. Genet. 73: 489-^95. 

11 
ma 

Physiol. Plant. 9?|| 



||ell t D. J.,Kessler,K & 
"laments of the chloropla 
Jn ! ce266: 1007-1012. 
jfet, K., Borchert, S., Mul 
iCransport of inorganic 
SKospnates across the ei 
Priyloplasts. - Planta 19* 
Prion, J. C. 1982. Maize 
§8k>l°S ical ^search (W 
%fant Molecular Biolog; 
JjSBN 0-9608758-0-8. 
PEcheverria, E. & Boyt 
plasts from developing e: 
M/lethods Enzymol. 148: 
§|pien, F.-M. & Liu, K.-( 
^jgars in developing ma: 
mjesis of ADP-glucose 
**|5-843. 

fc iA. M., Denver K. & N 
iniount and structure o: 
pysiol. 107: 673-677. 
|t|an, T. D. & Kaneko, \ 
fcbtfes amyloplast mem 
KM84. 



J$v.. 



^•byT.M.Mui-phy 
Pliant. 97, 1996 



Best Available Copy 



, R. S. Ill & PatersonlL 
nt RNA staining an d tlfrf 

r, C. D. 1993. Biotech, 
tte for sweet maize wh 

i, F.-G. & Takabe, f| 
stroma, internal menS 
; membranes of anjyi^, 
: of sycamore {AcerYrkm 
iol 29: 1315-1321. M 
rchert, S. 1991. Divel 
3Sphate trans locators^ 
5:341-343. M 
ividence that glucose^!! 
ite for starch synthesis! 
ryos. -Planta 185: 9ifs 
i2. Determination of 
} n Methods in Carboff 
M. L. Wolfrom, eds)V 
New York, NY. ISB§ 

I. A simple and rapidlfnei 
int DNA. - Biotechm§~ 

5. M. 1989. Chloroplall 
ss the envelope membW 
: Mol. Biol. 40: 471-5|HL 
'4. Evidence for a gluco! 
ige tissue amyloplasts|; 
ension-cultured cells.f 



D - J " Kessler * F - & Blobel » G - l994 - Isolation of com- 
^feonents of the chloroplast protein import machinery. - Sci- 
mLt 266: 1007-1012. 

Wit Borchert, S., Muller-Rober, B. & Heldt, H. W. 1995. 
transport of inorganic phosphate and C3- and C6-sugar 
jk^sphates across the envelope membranes of potato tuber 
^Syloplasts. - Planta 196: 647-652. 
irion J C. 1982. Maize endosperm cultures. -In Maize for 
Jiological Research (W. F. Sheridan, ed.), pp. 397^100. 
*RJant Molecular Biologv Association, Charlottesville, VA. 
ISBN 0-9608758-0-8. 

SEcheverria, E. & Boyer, C. D. 1987. Isolation of amylo- 
Ifests from developing endosperm of maize {Zea mays L.). - 
Methods Enzymol. 148: 226-234. 

Jff»ien, F.-M. & Liu, K.-C. 1996. Nucleotides and nucleotide 
lllgars in developing maize {Zea mays L.) endosperms: Syn- 
desis of ADP-glucose in brittle-l. - Plant Physiol. 110: 
|35-843. 

mi A. M., Denyer K. & Martin C. R. 1995. What controls the 
%iriount and structure of starch in storage organs. - Plant 
iRhysiol. 107: 673-677. 

$iyan, T. D. & Kaneko, Y 1995. The maize brittlel gene en- 
codes amyloplast membrane polypeptides. - Planta 196: 
17-484. 



- , Strelow, L, I., Illingworth, C. A., PhiUips, R. L. & Nelson, 
O. E. Jr. 1991. Analysis of maize Briiile~\ alleles and a de- 
fective suppressor-mutator-induced mutable allele. - Plant 
Cell 3: 1337-1348. 

Takahashi, R., Valeika, S. R. & Glass, K. W. 1992. A simple 
method of plasmid transformation of E. coli by rapid freez- 
ing. - Biotechniques 13: 711-715. 

Tobias, R. B., Boyer, C. D. & Shannon, J. C. 1992. Alteration in 
carbohydrate intermediates in the endosperm of starch-defi- 
cient maize {Zea mays L.) genotypes. - Plant Physiol. 99: 
146-152. 

Villand, P. & Kleczkowski, L. A. 1994. Is there an alternative 
pathway for starch biosynthesis in cereal seeds? - Z. Natur- 
forsch 49c: 215-219. 

Wentz, J. B. 1926. Heritable characters in maize. XXVI-con- 
cave. - J. Hered. 17: 327-329. 

Yu, S.-M., Kuo, Y.-H., Sheu, G., Sheu, Y.-J. & Liu, L.-F. 1991. 
Metabolic derepression of a-amylase gene expression in 
suspension-cultured cells of rice. - J. Biol. Chem. 266: 
21131-21137. 



i simple procedure fori 
mid DNA. -Biotecliniq| 

stra, K. 1992. Informally 
inner envelope membrj 
■n of the maize Br7-en$ 
18999-19004. 
on, J. C. 1992. Carbofi| 
la mays L.) amyloplasts! 
L99:S-39. 
:tics and morphology 
£. - Conn. Agric. Exjp 

arch synthesis in maig 
Physiol. Plant Mol. Bio 

native pathway for starcff 
50-564. 
Gibberellins and reproducl 
Annu. Rev. Plant Physioj| 

ecular biology of starc§| 
Oxford Survey of PlanK 
Vol. 7 (B. J. Miflin, edll 
Press, Oxford. ISBN ]0f 

taus, H. E. 1995. Inducting 
r activity in spinach cnh 
3-121. §L 
■f gibberellin biosynmesypl 
lorticulture. - In Gibbertjf 
and J. MacMillan, eds)| 
ew York, NY. ISBN Ojf 

vlaniatis, T. 1989. Mold 
1,2 Ed. -Cold Spring Hafl 
Dr, NY. pp 18.47-18.74. p 

987. Starch characteristic 
it maize endosperm. • 



Physiol. Plant. 97g 



0i 



»d by T.M. Murphy 
pii. Plant. 97, 1996 



673 



Discovering Empirically Conserved Amino Acid 
Substitution Groups in Databases of Protein Families 



Thomas D. Wu and Douglas L. Brutlag 

Department of Medicine, Section on Medical Informatics, 
and Department of Biochemistry 
Stanford University Medical Center 
Stanford, California 94305 
twu@camis.stanford.edu, brutiag@cmgm.stanford.edu 



Abstract 

This paper introduces a method for identifying empirically 
conserved amino acid substitution groups. In contrast with 
existing approaches that view amino acid substitution as a 
pairwise phenomenon, the method presented here identifies 
conserved groups of amino acids using a data structure 
called a conditional distribution matrix. The conditional 
distribution matrix extends the concept of a pairwise 
substitution matrix by changing the context of substitution 
from a single amino acid to a group of amino acids. The 
matrix tabulates information from a database of protein 
families that contains numerous aligned positions. Each row 
in the matrix contains the distribution of amino acids in 
those aligned positions that contain a given conditioning 
group of amino acids. The method converts a database of 
protein families into a conditional distribution matrix and 
then examines each possible substitution group for evidence 
of conservation. The algorithm is applied to the BLOCKS and 
HSSP databases. Twenty amino acid substitution groups are 
found to be conserved empirically in both databases. These 
groups provide insight into biochemical properties that are 
conserved in protein evolution. 

Introduction 

Protein sequences often exhibit variability in their amino 
acid patterns, a phenomenon that can be characterized as 
substitutions of amino acids for one another. Amino acid 
substitution has been viewed largely as a pairwise 
phenomenon. Typically, this phenomenon is represented 
by the frequency that one amino acid replaces another one; 
the entire set of replacement or substitution frequencies is 
organized into a symmetric 20 x 20 substitution matrix 
containing 210 distinct pairwise frequencies. Substitution 
matrices have been studied in depth (Altschul 1991; 
Gonnet, Cohen, & Benner 1992; Jones, Taylor, & Thornton 
1992; Vogt, Etzold, & Argos 1995), and various matrices 
have been proposed, including the well-known accepted 
point mutation (PAM) matrix of Dayhoff and colleagues 
(1978). In many cases, such substitution matrices have 
proven quite useful for comparing, aligning, and exploring 
relationships between pairs of protein sequences. 



However, for groupwise or consensus relationships, 
statistics and methods based on pairwise comparisons are 
often inadequate. The shift from pairwise comparisons to 
groupwise analyses is often challenging and non-trivial, as 
can be seen from the difficulties in trying to align multiple 
sequences (Barton 1990). Sometimes such problems can 
be approached using pairwise methods, but often new 
methods are needed. In computational biology, much 
attention has focused recently on groupwise or consensus 
analyses, such as the classification of families and super- 
families of protein sequences and structures, and the 
compilation of protein family databases, such as PROSITE 
(Bairoch 1991), BLOCKS (Henikoff & Henikoff 1991), and 
HSSP (Sander & Schneider 1991) 

In light of these advances in computational biology, we 
present in this paper an empirical analysis of amino acid 
substitution using a group perspective. We introduce a 
novel method for identifying groups of amino acids that 
substitute for one another with high frequency. Our 
method identifies these substitution groups empirically 
from a collection of multiple sequence alignments. 
Although some researchers have also used multiple 
sequence alignments to study amino acid substitution 
(Henikoff: & HenikofT 1992), their goal has been to derive 
new substitution matrices, whereas our goal is to identify 
substitution groups. 

Various classifications of amino acids into meaningful 
and useful groups have been proposed in other studies, as 
summarized in Table 1. However, previous methods for 
identifying substitution groups have either analyzed 
pairwise data or used theoretical principles rather than 
empirical data. Some researchers have used pairwise 
substitution matrices to infer substitution groups (Dayhoff, 
Schwartz, & Orcutt 1978; Miyata, Miyazawa, & Yasunaga 
1979). A major problem with such an approach is that 
substitutability is not necessarily transitive. That is, even if 
amino acids A and B substitute for each other in some 
contexts and amino acids B and C substitute for each other 
in other contexts, we cannot automatically conclude that 
amino acids A and C substitute for each other. Another 
problem with pairwise analyses is that they are limited in 
their ability to distinguish different biochemical contexts 
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Reference 



Substitution groups 



Dayhoffetal. 1978 
Miyataetal. 1979 
Jimenez-Montano & 
Zamora-Cortina 1981 

Taylor 1986 

Smith & Smith 1990 
Mocz 1995 
Naoretal. 1996 
Klingler 1996 



C, FWY, HKR, DENG, I LMV, AGPST 
C, FWY, HKR, DENQ, I LMV, AGPST 

AG, DE, KR, NQ, ST, FWY, I LMV, CFILMVWY, ADEGHKNPQRST 

Approximately 70 union and intersection combinations of HKR, I LV, ACGS, H FWY, DEHKR, 

ACDGNPSTV, CDEHKNQRSTWY, ACFGHIKLMTVWY 

P, AG, DE, NQ, ST, FWY, HKR, I LV, CFILMVWY, DEHKNQRST 

AEHMQY, FIKLVW, CDGNPRST 

ON, GP, DGNP, EKQR, FI LV, OEKNQR, ACFI LMVWY, DEGHKNPQRST 
H, K, N, AP, CF, OE, GS, KR, AGS, I LV, NQR, QTY, HMTWY, CFI LMVW 



Table 1 Amino acid substitution groups identified in previous studies. Substitution groups are arranged in order of 
increasing size. Singleton groups, consisting of a single amino acid, are listed when the study indicated that the amino acid 
was relatively unlikely to substitute for other amino acids. 



for substitution. The concept of amino acid substitution 
inherendy requires a context; without a context, we merely 
have a marginal distribution of amino acid frequencies. 
Therefore, substitutability essentially consists of rules of 
the form, "When context X is present, amino acid A 
substitutes with frequency f" With pairwise data, the 
context X can be specified only as a single amino acid. In 
contrast, in this paper, we consider more expressive and 
specific contexts that contain groups of amino acids rather 
than a single amino acid. 

Other researchers have proposed substitution groups on 
theoretical rather than empirical grounds (Jimenez- 
Montano & Zamora-Cortina 1981; Kidera et al. 1985; 
Taylor 1986; Smith RF and Smith TF 1990; Mocz 1995). 
These theoretical analyses use measurements of various 
amino acid properties, such as volume, charge, and 
hydrophobicity, and then propose substitution groups that 
should be conserved. Unfortunately, theoretical models 
may not necessarily correspond to the patterns of 
conservation observed empirically. Moreover, amino acid 
properties often depend on particular biochemical 
environments found in protein structures, so properties of 
amino acid in isolation may not reflect the complexities of 
amino acid substitutions in particular contexts. 

In our approach, we analyze each possible substitution 
group on its own merits. Hence, substitution groups may 
overlap or subsume one another. Our approach differs 
from that of some other researchers, who require that 
substitution groups do not overlap. For example, Mocz 
(1995) uses clustering techniques to identify three mutually 
exclusive clusters of amino acids. The mutual exclusion 
requirement means that each amino acid can belong only to 
a single substitution group. Another restriction sometimes 
placed is that the groups must be organized into a strict 
hierarchy (Smith RF & Smith TF 1990) or Venn diagram 
(Taylor 1986). We believe that such requirements are 
unnecessarily restrictive. Each amino acid has several 
properties and can serve different functions, depending on 
the biochemical context. In some contexts, the size of an 
amino acid may be critical; in others, its charge may be the 



conserved property. These different contexts will not 
necessarily fit into a mutually exclusive, strictly 
hierarchical, or set-theoretical scheme. Therefore, in our 
approach, we analyze each substitution group separately for 
empirical evidence of conservation. 

In order to analyze each possible substitution group 
independently, we require more information than is stored 
in a pairwise substitution matrix. We therefore extend the 
idea of a substitution matrix to a larger structure called a 
conditional distribution matrix. This matrix provides the 
foundation for a consensus-based analysis of amino acid 
substitution. In the rest of this paper, we discuss two 
databases of protein families, BLOCKS and HSSP, that 
make a consensus-based analysis possible. We then 
present the concept of the conditional distribution matrix 
and the criteria we use to identify empirically conserved 
substitution groups. We then present independent analyses 
of substitution groups conserved empirically in the 
BLOCKS and HSSP databases. We find that twenty 
substitution groups are conserved in both databases, and we 
propose biochemical characteristics underlying those 
groups. Finally, we discuss various features of our 
approach and suggest how our results may be used in 
further work in computational biology. 

Methods 

Data 

Our method requires a source of aligned positions; this data 
is readily available from databases of protein families or 
multiple sequence alignments. Two of largest and most 
widely used protein family databases are the BLOCKS and 
HSSP databases. Although these databases have distinct 
characteristics, they can still be viewed as collections of 
aligned positions. 

The BLOCKS database (Henikoff & Henikoff 1991) 
contains short, highly conserved regions of protein 
families, represented by ungapped multiple alignments 
called blocks. Blocks are generated from a set of related 
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Figure 1 Comparison of BLOCKS and HSSP databases. The histograms show the number of protein families of different 
sizes in the two databases, where the size of a family may be measured by the number of sequences or aligned positions that 
it contains. The horizontal scales are selected to be the same dimension to facilitate comparison. The BLOCKS database 
actually contains some protein families with as many as 507 sequences, and the HSSP database contains some protein families 
with as many as 1983 aligned positions. 



protein sequences. Conserved regions are then found 
V within these sequences using a motif finding program 
% (Smith HO, Annau, & Chandrasegaran 1990), and the 
edges yf these regions are extended until a similarity score 
falls below some threshold. The similarity score for local 
alignment and extension is based on the BLOSUM 62 
substitution matrix. Finally, a highly scoring set of blocks 
is selected from all possible conserved regions using an 
optimal path algorithm. In our study, we used version 8.0 
of the BLOCKS database (9 August 1994), which contains 
2884 blocks constructed from 770 protein groups in 
PROSITE version 12.0. 

The HSSP (Homology-derived Secondary Structure of 
Proteins) database (Sander & Schneider 1991) combines 
structural data from the PDB (Protein Data Bank) database 
and sequence data from the SWISSPROT database. Each 
HSSP family corresponds to a PDB structure, and contains 
all SWISSPROT sequences that are homologous above a 
certain length-dependent threshold, using the Smith- 
Waterman alignment algorithm and a substitution matrix 
by McLachlan (1971). We used the version of HSSP dated 
16 November 1995, which contains 3569 protein families. 

The BLOCKS and HSSP databases are constructed in 
quite different ways for different purposes. The BLOCKS 
.database aims to describe sequence homology, whereas the 
HSSP database aims to describe structural homology, 
infeiTed from sequence homology. The BLOCKS database 
contains families based on optimally scoring multiple 
sequence alignments of locally conserved regions, whereas 
HSSP contains families based on pairwise sequence 
comparisons over global regions. The BLOCKS database 



does not allow gaps in its alignments, whereas HSSP does. 
The two databases use different substitution matrices for 
computing alignments. 

These differences produce different types of multiple 
sequence alignments, as shown in Figure 1 . A protein 
family can be characterized by the number of sequences 
and aligned positions it contains. In the HSSP database, 
different sequences may contribute to each aligned 
position, so the number of sequences should be averaged 
over all positions. As Figure 1 shows, the number of 
sequences per family in BLOCKS varies widely, ranging 
from 2 to 507 (mean = 15.6), whereas in HSSP, it is 
relatively narrow, ranging from 1 to 70 (mean = 24.9). 
Conversely, the number of positions per family in BLOCKS 
ranges only from 4 to 55 (mean = 32.8), whereas in HSSP, 
it ranges from 12 to 1983 (mean = 266.9). These 
histograms, reflect the stricter requirement that BLOCKS 
places on' each aligned position, requiring it to be 
conserved across all sequences. On the other hand, HSSP 
may include an aligned position that is conserved for some 
but not all homologous sequences. 

Conditional Distribution Matrix 

Our algorithm consists of two steps. First, we convert a 
database of aligned positions, such as those in BLOCKS and 
HSSP, into a large data structure called a conditional 
distribution matrix (CDM). Then, we look for 
statistically significant groups of amino acids within this 
matrix. In this section, we describe the CDM. 
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Figure 2 Conditional distribution matrix. Each row 
corresponds to a conditioning group, which identifies a 
subset of the aligned positions in the database. The entries 
in. each row contain the distribution of amino acids in those 
aligned positions. 

The conditional distribution matrix can be thought of as 
an extension of the 20 x 20 pairwise substitution matrix. 
The pairwise substitution matrix essentially contains 
different contexts for amino acid substitution, where each 
context is a single amino acid. For each context a, the 
pairwise substitution matrix contains the distribution of 
amino acids that substitute for a. Hence, entries in the 
pairwise substitution matrix are substitution frequencies 
/(a' | a) that indicate the likelihood that amino acid a' 
substitutes in the context of amino acid a. If we think of 
the Context as being on the vertical axis, then each row 
contains the distribution for a different context 

If we extend the idea of context from a single amino acid 
to a set of amino acids, we obtain the conditional 
distribution matrix (Figure 2). The contextual set of amino 
acids is called a conditioning group. Each conditioning 
group A corresponds to a subset of the aligned positions in 
the database. To. understand the correspondence, suppose 
that we represent an aligned position as a set P of amino 
acids. The setP is the union of all amino acids in the 
aligned position. Then, we say that an aligned position P 
satisfies conditioning group A if AqP. In other words, 
the aligned position must contain at least one instance of 
every amino acid in the conditioning group. The aligned 
position may, of course, contain other amino acids as well, 
and in fact, it is the distribution of these other amino acids 
that we are interested in. The conditioning group A 
provides a context for substitution, whereas the remaining 
amino acids in (P-A) substitute in that context. 

Note that a given aligned position may satisfy several 
conditioning groups. For instance, consider an aligned 



position that contains 5 occurrences of valine, 4 
occurrences of leucine, and 3 occurrences of isoleucine. 
Then, the aligned position satisfies the conditioning groups 
ILV, I L, IV, LV, I, L, V, and the null set 0. This 
multiple correspondence is appropriate because it is 
difficult to know a priori the biochemical context or 
functionality of an aligned position and hence its 
substitution pattern. For instance, in this example, the 
underlying context might be I LV, with no substitutions Or 
the underlying context might be LV, with isoleucine 
substituting at a high frequency. Or perhaps the context 
might simply be V, with leucine and isoleucine both 
substituting at a high frequency. 

Although we may not be able to infer substitution 
patterns from a single aligned position, sampling over 
several aligned positions may provide statistically 
meaningful answers. The conditional distribution matrix 
accumulates the substitution patterns for various possible 
contexts over numerous aligned patterns. This matrix, 
which is of size 2 20 x 20, has a row for each possible 
conditioning group A. Each row contains the distribution 
of substituting amino acids in the context of a certain 
conditioning group. Each entry in the CDM contains a 
conditional count c(a\A) f which equals the total number 
of occurrences of amino acid a over all aligned positions 
that satisfy conditioning group A. We may think of the 
counting process conceptually as finding all aligned 
positions that satisfy conditioning group A, and then 
tabulating all amino acids in those positions. (In practice, 
however, the CDM is not constructed row by row, but by 
processing each aligned position in a database sequentially. 
For each aligned position P, we add the counts of amino 
acids in P to all rows A, such that AgP.) 

Since the number and sizes of aligned positions 
satisfying each conditioning group varies, we normalize the 
conditional counts to obtain a conditional frequency: 

/(a|A)= C ^ where at A. 
JL c \ a \ A ) 

Note that the normalizing value in the denominator 
excludes amino acids in the conditioning group. This 
avoids the problem of circularity, whereby the count of an 
amino acid in the group is elevated simply because the 
group selects for it. Rather, we are interested in the 
distribution of amino acids outside the conditioning group. 

In order to evaluate these conditional frequencies, we 
require an expected value for comparison. The expected 
frequency comes from the marginal distribution of amino 
acids, that is, the distribution across all aligned positions in 
the database. In fact, if the null set is a considered a 
conditioning group, the marginal counts c(a) will be 
stored in the CDM. The expected conditional frequency 
derives from the marginal counts as follows: 

fl(a\A)= J^ A . where a^A. 

a'eA 
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In addition to first-order statistical characteristics, such 
as the observed and expected frequencies, we may also 
compute second-order statistical characteristics, such as the 
standard error of the pro portion \i\ 

. °< fl|A) -J 

\ a'fA 

Given these first- and second-order quantities, we can 
evaluate an observed conditional frequency / using the 
relative deviate or Z-score. The Z-score indicates the 
number of standard errors d(a\A) that an observed 
frequency f{a\A) differs from the expected frequency 
H(a\A); 

o[a\A) 

The Z-score indicates whether an amino acid is over- or 
under-represented in the context of a given conditioning 
group. If the Z-score is positive, the amino acid is over- 
represented in that context; and if negative, it is under- 
represented. We may imagine that each conditioning group 
induces a frequency distribution on the other amino acids. 
Amino acids with positive Z- scores are positively induced, 
whereas those with negative Z-scores are negatively 
induced. For the purpose of definition, we use three 
standard errors as a threshold: A Z-score greater than 3 
reflects positive induction; a Z-score less than -3, negative 
induction; and between 3 and -3, a neutral effect 

Criteria for Empirical Conservation 

The conditional distribution matrix can be analyzed for 
evidence that a given substitution group is conserved. In 
particular, the Z-scores provide a basis for identifying 
substitution groups that are empirically conserved. 
Intuitively, we consider a substitution group A to be 
conserved empirically if amino acids within the group 
substitute for one another significantly more frequently 
than amino acids outside the group substitute for amino 
acids in A. 

More formally, we consider a group conserved 
empirically if it is both compact and isolated. Compactness 
means that all amino acids in the group substitute for one 
another frequently, and isolation means that amino acids 
outside the group do not substitute for those within the 
group as frequently. We measure substitutability within 
the group as the Z-score of each amino acid conditioned on 
other members of the group, or Z(a | A - {a}) . The overall 
compactness, or compactness score, is the minimum of 
these scores: 

C(A) = min Z(a|A - {a}) 

We measure substitutability of amino acids outside the 
group for those within the group as the Z-score of each 
amino acid conditioned on the group, or Z(a|/l). Because 
a high score indicates that an amino acid outside the group 
should belong to the group, we define the interference 
score to be the maximum of these scores: 



7(A) = rnaxZ(fl|A) 

Finally, we quantify the conservation of a substitution 
group by the difference between its compactness and 
interference scores. We call this the separation score: 

S(A) = C(A)-1{A) 
When a substitution group has a statistically significant 
separation score, we say that the substitution group is 
conserved empirically. We set the threshold for 
significance at three standard errors, which is equivalent to 
a significance level of 0.01. We examined each possible 
substitution group for a separation score greater than three 
standard errors. 

Because we test a large number of substitution groups 
independently, one may ask whether the number of tests 
itself will yield a large number of significant results. 
Surprisingly, the answer is no. Let the size N be defined as 
the cardinality of substitution group A, that is, the number 
of amino acids in A. Consider all possible substitution 
groups of size N; there are "20 choose N" or 
20!/ [AM (20 ~N)\] such groups to be tested. If Z-scores 
are distributed randomly, then a group has a positive 
separation score whenever the Z-scores of the N amino 
acids in the group are all greater than the Z-scores of the 
(20 -N) outside the group. Hence, the probability of 
achieving a positive separation score for a group of size N 
is the permutation of N multiplied by the permutation of 
(20 -W), divided by all permutations of the 20 amino 
acids. This is simply the reciprocal of the number of 
groups of size N. Therefore, among all substitution groups 
of size N, we expect to see one group with a positive 
separation score by random chance. Hence, we need not 
make a provision, such as a Bonferroni correction, for the 
large number of tests. 

Examples 

To gain a better understanding of our method, we look at 
some examples. Consider the substitution group I LV; our 
analysis of this group using the BLOCKS database is shown 
in Table 2(a). The Z-scores for isoleucine, valine, and 
leucine are all show high rates of substitution for one 
another, significantly higher than any other amino acid. 
The closest amino acid that interferes with this group is 
methionine. Although methionine is positively induced by 
I LV, there' is a clear separation of 102.4 standard errors 
between the substitution frequencies of I LV and M, which 
is highly significant. Hence, I LV is conserved empirically 
in the BLOCKS database. Note that I LV positively induces 
the hydrophobic amino acids M, F, T, A, and Y; negatively 
induces Q, C, S, P, W, N, R, E, D, and G; and has a neutral 
effect on H. The group also positively induces K slightly, 
which makes biochemical sense because the long side 
chain of lysine gives it a partially hydrophobic character. 

In contrast with I LV, most substitution groups were not 
conserved empirically. Table 2(b) shows the analysis for 
the group GIM, which scored the lowest among all 
substitution groups of size 3 in the BLOCKS database. G I M 
scores poorly because it is not compact. Glycine 
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(a) Substitution group I LV (5328 positions) 
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Table 2 Analyses of substitution groups I LV, GIM, and F I V. The amino acids in each group are separated from those 
outside the group by a double bar, and then sorted by Zrscore. For each substitution group A and amino acid a, the values to 
the left of the double bar are Z(a | A - [a]) , and the values to the right of the double bar are Z(a | A) . A single vertical bar 
separates amino acids that are positively, neutrally, and negatively induced by the substitution group. 



substitutes only rarely in those positions with both 
isoleucine and methionine, as shown by its under- 
representation of 6L9 standard errors. Hence, GIM is not 
conserved empirically in the BLOCKS database, for reasons 
of non-compactness. 

Another substitution group, F I V , shown in Table 2(c), 
failed pur criterion because it was not isolated. Although 
its compactness score is relatively high, meaning that the 
three amino acids each substitute for one another 
frequently, its interference score is even higher, because 
leucine substitutes in this context even more frequently. 
Therefore, the substitution group F I V is also not conserved 
empirically, for reasons of non-isolation. 

Results 

Our analysis of the BLOCKS database yielded 30 
substitution groups that are conserved empirically, and our 
analysis of the HSSP database yielded 51 substitution 
groups. These substitution groups are listed in Tables 3 
and 4, respectively. Twenty substitution groups are 
conserved empirically in both databases. We feel that the 
validation of these substitution groups by both databases 
provides strong evidence that they are indeed conserved in 
nature. We therefore consider further the biochemical 
characteristics of these substitution groups. 

Of the 190 possible amino acid groups of size 2, nine are 
conserved empirically in both databases. These amino acid 
pairs are not evident immediately from substitution 
matrices. For example, the empirically conserved 
substitution groups have the following BLOSUM 62 scores: 
FY (score of 3), IV (3), ST (I), AS (1), DE (2), KR (2), DN 
(1), EQ (2), and HY (2). Conversely, the BLOSUM 62 
matrix contains several positively scoring amino acid pairs 
that were not conserved empirically in our study: LM, I L, 
and WY (all with scores of 2), and NS, NH, RQ, EK, KQ, IM, 
MV, LV, and FW (all with scores of 1). Hence, results from 
our analysis appear to go beyond substitution matrix data. 



The empirically conserved substitution groups are 
consistent with biochemical intuition. The substitution 
group FY is the most significant in both databases. Both 
phenylalanine and tyrosine have side chains with a single 
aromatic ring, and have similar volume. The group I V 
contains amino acids with aliphatic side chains that branch 
at the beta carbon. The groups DN and EQ are both acid- 
amide combinations with very similar side chains. In 
addition, the two acidic amino acids, D E, are conserved 
empirically. However, the two amides, NQ, do not form a 
substitution group empirically, even though they might 
seem to belong together on theoretical grounds. As we 
shall see later, glutamine tends to cluster more with the 
long-chain polar amino acids, such as lysine and arginine. 
The basic amino acids, KR, are conserved empirically, and 
both have amino groups. The group ST contains amino 
acids that have short hydroxy side chains. Serine is also 
conserved empirically with alanine (AS); both amino acids 
are small, each containing a single carbon atom in its side 
chain. Nevertheless, the smallest amino acid, glycine, does 
not form a group with serine or alanine, perhaps because 
glycine has many distinctive properties. Finally, the group 
HY is conserved empirically in both databases. Both amino 
acids have polar ring structures, so the combination of 
similar volume and polarity appears to account for their 
conservation. 

For amino acid groups of size 3, the two databases 
identified six empirically conserved substitution groups in 
common. The highest scoring amino acid triplet in both 
studies was I LV. All three amino acids in this group have 
branched aliphatic side chains. As we noted previously, 
isoleucine and valine form a substitution group themselves. 
One explanation may be that isoleucine and valine are both 
branched at their beta carbons, whereas leucine is branched 
at its gamma carbon. Apparently, branch position matters 
in some biochemical environments, but not in others. 
Another conserved amino acid triplet is FWY. All three 
amino acids in this group have aromatic side chains, 
although tryptophan has a double ring. Since 
phenylalanine and tyrosine themselves form a substitution 
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Table 3 Substitution groups conserved empirically in the BLOCKS database. Groups are arranged according to their 
size and sorted by separation score. Groups conserved empirically in both BLOCKS and HSSP databases are marked 
with a bullet. For each group, the table lists the number of positions in the database satisfying the group; the 
compactness, interference, and separation scores; and their effect on amino acids outside the group, listed in order of 
descending Z-score. 
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5.8 




IGOWC 




ADEFGHMNPQRSTVWY 


28 


9.4 


1.3 


8.1 




LCIK 





Table 4 Substitution groups conserved empirically in the HSSP database. For explanation, see caption for Table 3. 
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Figure 3 Classification of empirically conserved amino acid substitution groups. This classification contains all substitution 
groups that are conserved empirically in the BLOCKS and HSSP databases. The substitution groups are linked by 
subsumption relationships. Cysteine, glycine, and proline do not belong to any substitution group. 



group, it appears that single-ring aromatic side chains are 
conserved in some contexts, but in other contexts, 
aromaticity itself is conserved. A closely related amino 
acid triplet that is also conserved empirically is FLY. The 
biochemical similarity for this group appears to be volume. 
Both phenylalanine and tyrosine have a bulky aromatic 
group at their gamma carbons, whereas leucine has a 
branched methyl group there. Perhaps in some 
environments, the branched methyl group provides enough 
volume and hydrophobicity to substitute for the aromatic 
ring. The amino acid triplet of AST contains amino acids 
that have short side chains, with either one or two carbons. 
The remaining amino acid triplets, EKQ and KQR, contain 
amino acids with relatively long polar side chains. The 
biochemical basis for conserving both triplets is not 
immediately clear. The two groups both contain lysine and 
glutamine, but one triplet has glutamate and the other has 
arginine. Arginine can donate a hydrogen bond, whereas 
glutamate cannot. In addition, the side chain of arginine is 
much larger than that of glutamate. Perhaps amino acids 
with long polar side chains and hydrogen bond donor 
capability (KQR) are conserved in different biochemical 
environments than amino acids with medium-length polar 
side chains (EKQ). 

The two databases identified only two empirically 
conserved substitution groups of size 4 in common. One 
group, ILMV, is a well-recognized group of small 
hydrophobic amino acids. The other group, EKQR, 
contains amino acids with long polar side chains. This 
group subsumes the triplets EKQ and KQR discussed 
previously. These amino acids have been observed to 
participate in salt bridges on the surfaces of proteins, which 
help stabilize protein structure (Goldman 1995). 

For substitution groups of size 5, the two databases 
identified two empirically conserved groups in common. 
One group, FI LMV, contains what are referred to as the 
major hydrophobic amino acids. The other group, FILVY, 
demonstrates that tyrosine sometimes acts as a hydrophobic 
amino acid. 

Finally, both databases identified the six-member amino 
acid group F I LMV Y as being conserved empirically. This 
group is a combination of the two substitution groups of 



size 5 and contains amino acids with hydrophobic 
characteristics. 

For the larger substitution groups, the two databases 
correlated less well than for smaller substitution groups. 
Conservation of large substitution groups is difficult to 
identify because few aligned positions in the BLOCKS and 
HSSP databases satisfy large conditioning groups. In fact, 
some of these findings are based on fewer than 100 aligned 
positions and may not be reliable. Moreover, aligned 
positions that do satisfy a large conditioning group must 
contain many sequences and many different amino acids, 
meaning that the position may not be conserved well. In 
addition, the large number of sequences means that a few 
protein families could bias the results. As databases grow 
larger, we might expect to obtain more accurate results for 
large groups. 

Nevertheless, the two databases showed a near-match 
with the groups ADEGHKNPQRST in BLOCKS and 
ADEHKNPQRST in HSSP. The two groups differ on 
whether glycine belongs, with BLOCKS including glycine 
and HSSP excluding it. Furthermore, in these larger 
substitution groups, revealing insights can be obtained from 
examining the amino acids that they induce negatively. For 
instance, both databases identify substitution groups that 
negatively induce the hydrophobic amino acids. These 
substitution groups differ slightly between the databases, 
perhaps reflecting the heterogeneity of hydrophilic 
environments. In addition, both databases identify 
substitution groups that negatively induce tryptophan, 
cysteine, and sometimes phenylalanine. These large 
substitution groups might therefore be defined in a negative 
sense, by specifying the absence of certain amino acids. 

Our set of empirically conserved substitution groups is 
generally different from those found in the literature, 
although some similarities can be found. Of the 20 groups 
found in both BLOCKS and HSSP, eight have been 
proposed explicitly in previous analyses: DE, KR, ST, DN, 
ILV, FWY, ILMV, and EKQR. In addition, previous 
analyses have proposed the groups F I L V and 
ADEGHKNPQRST, which are conserved empirically only in 
BLOCKS, and the group DENQ, which is conserved 
empirically only in HSSP. 
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The twenty substitution groups conserved empirically in 
both databases can be organized into a classification 
hierarchy, as shown in Figure 3. In this hierarchy, the 
amino acids are divided into three major classes. One 
class, MIVLFWYH, contains hydrophobic amino acids; 
another class, RKQEDN, contains charged or polar amino 
acids; and the third class, AST, contains small amino acids. 
In addition, three amino acids — cysteine, glycine, and 
proline — do not belong to any substitution group. These 
amino acids have unique properties that cannot be easily 
fulfilled by other amino acids. Cysteine can form disulfide 
bridges. Glycine is the smallest amino acid, having only a 
hydrogen atom for its side chain. And proline has a 
distinctive cyclical side chain that causes it to form bends 
in helices and strands. 



Discussion 

Because our approach to amino acid substitution is 
empirical, it enjoys the same advantages and suffers the 
same limitations as all empirical studies. One feature of 
our approach, which could be viewed as either an 
advantage or limitation, is that our analysis is general. 
Although our analysis conditions on specific groups that 
represent specific biochemical contexts, the substitution 
groups in our study are nevertheless conserved empirically 
across an entire database. In contrast, many models for 
describing conservation, such as motifs (Bairoch 1991), 
profiles (Gribskov, McLachlan, & Eisenberg 1987), and 
hidden Markov models (Krogh et al. 1994), characterize 
specific protein families. In those models, each protein 
family has its own pattern of conservation. The issue is 
whether generalized biochemical contexts exist and 
whether selecting aligned positions across an entire 
database adequately specifies a single biochemical context 

Our opinion is that general patterns of conservation do 
exist, and that understanding them is critical to 
understanding specific protein families. We believe that 
nature is likely to use the same patterns over and over. A 
strategy of finding general patterns of conservation may be 
especially fruitful because most protein families are 
relatively small and the biochemical context of each 
position is not known. By drawing upon a iarge amount of 
data, a general approach is more likely to minimize 
statistical noise and extract meaningful signals. 

Nevertheless, we acknowledge that specific patterns of 
substitution may occur only in specific protein families. 
Unfortunately, selecting those protein families may be 
problematic. An intermediate approach based on 
secondary structure might prove fruitful, since alpha- 
helices and beta-strands likely exist in different 
biochemical environments. Data sets of alpha-helices and 
beta-strands may generate different sets of substitution 
groups, which might otherwise be obscured in the entire 
protein family database. Another strategy would be to 
analyze large families of proteins with similar function, 
such as the globins or kinases. In future work, we plan to 
apply our method to such specialized data sets. 



Empirical studies, such as this one, must also consider 
the issue of sampling bias. Protein families often contain 
sequences closely related protein sequences that are over- 
represented, and distantly related sequences that are under- 
represented. Several methods have been proposed to 
weight the sequences to remove this bias (Altschul, Carroll, 
& Lipman 1989; Sibbald & Argos 1990; Vingron & 
Sibbald 1993). We believe that such weighting methods 
might help to strengthen our results, and we intend to study 
the effect of sequence weighting on our analysis of amino 
acid conservation. 

Another characteristic of our work is that our analysis 
has been biased towards regions that are highly conserved. 
In particular, the BLOCKS database contains regions with 
relatively low rates of mutability. Our analysis therefore 
might miss acceptable substitutions that occur in poorly 
conserved, highly variable regions. However, our goal has 
not been to find acceptable substitution groups that are 
weakly conserved, but rather to identify groups that are 
strongly conserved and have empirical evidence to support 
them. We feel that it is these groups that give the best 
insight into the biochemical principles that are important in 
protein structure and function. 

We anticipate that a set of empirically conserved 
substitution groups may find several potential applications. 
First, such substitution groups may provide the basis for 
new methods for aligning multiple sequences. Most 
existing methods for aligning multiple sequences rely upon 
pairwise substitution frequencies. However, substitution 
groups may provide a more appropriate model for 
groupwise or consensus relationships. Second, substitution 
groups might provide an alphabet to describe discrete 
protein motifs. Most discrete motifs, such as those in 
PROSITE, are constructed manually, although automated 
methods have been developed recently (Wu & Brutlag 
1995). Both automated and manual methods for building 
discrete motifs would benefit from having a set of 
standardized substitution groups. Analogously, our 
analysis might even provide a basis for probabilistic motifs, 
such as hidden Markov models (Krogh et al. 1994). The 
conditional distribution matrix essentially contains 
probabilistic amino acid profiles for various biochemical 
contexts. Selected amino acid distributions from the matrix 
could serve as canonical distributions for hidden Markov 
models. These distributions might provide general, 
idealized models of amino acid substitution instead of the 
empirically tailored distributions based on a specific 
protein family that are currently used. Recent work on 
Dirichlet mixture priors (Brown et al. 1993) also tries to 
find idealized amino acid distributions. Finally, because 
substitution groups attempt to capture important amino acid 
properties, they might be helpful in predicting the 
secondary and tertiary structure of protein sequences. 
Many researchers have tried to generalize amino acid 
sequences in terms of their properties (Bork 1989); 
substitution groups may provide insight into properties that 
are conserved empirically. 
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Aside from these applications, though, we hope that our 
study leads to an improved understanding of amino acid 
substitution. Amino acid substitution is a central principle 
in molecular biology. Improved knowledge about amino 
acid substitution may ultimately lead to better 
understanding of protein structure and function. Patterns of 
amino acid substitution represent static evidence of the 
dynamic process of amino acid evolution and conservation. 
Findings such as those in this study are central to our 
understanding of protein structure, function, and evolution. 
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Functional evolution of an arsenic resistance operon has been accomplished by DNA shuffling, involv- 
ing multiple rounds of in vitro recombination and mutation of a pool of related sequences, followed by 
selection for increased resistance in vivo. Homologous recombination is achieved by random fragmenta- 
tion of the PCR templates and reassembly by primerless PGR. Plasm, d -determined arsenate resistance 
from plasmid pl258 encoded by genes arsR , arsfl, and arsC was evolved in Escherichia co/i. Three 
rounds of shuffling and selection resulted in cells that grew in up to 0.5 M arsenate, a 40-fold increase in 
resistance. Whereas the native plasmid remained episomal, the evolved operon reproducibly integrated 
into the bacterial chromosome. In the absence of shuffling, no increase in resistance was observed after 
four selection cycles, and the control plasmid remained episomal. The integrated ars operon had 
13 mutations. Ten mutations were located in arsB, encoding the arsenite membrane pump, resulting in a 
fourfold to sixfold increase in arsenite resistance. While arsC, the arsenate reductase gene, contained no 
mutations, its expression level was increased, and the rate of arsenate reduction was increased 12-fold. 
These results show that DNA shuffling can improve the function of pathways by complex and unexpected 
mutational mechanisms that may be activated by point mutation. These mechanisms may be difficult to 
explain and are likely to be overlooked by rational design. 

Keywords: combinatorial chemistry, sexual PCR* molecular libraries, metabolic engineering 



DNA shuffling 1 "* is a process for recursive recombination and 
mutation, performed by random fragmentation of a pool of related 
genes, followed by reassembly of the fragments by primerless PCR 
(Fig. 1). An advantage of DNA shuffling over rational design is that 
it can optimize the function of genes without first determining 
which gene product is rate limiting. Our goal is to develop com- 
mercial 'molecular breeding* processes for the efficient optimiza- 
tion of complex traits and with minimal understanding of the gene 
products and their interactions. 

This process has been shown to yield rapid functional opti- 
mization of genes at rates and to an extent not previously possible in 
laboratory conditions. However, all previously investigated exam- 
ples were single genes that were improved by point mutations 2 * 4 " 6 . 

In addition to point mutation we would like to be able to gen- 
erate diversity using a wide variety of mutational mechanisms, 
such as polynucleotide deletion, insertion, and inversion, as well as 
integration and excision. These mutational mechanisms are widely 
used in nature but find only limited application in rational design. 

We have already observed deletions and insertions due to shuf- 
fling (data not shown). We show that DNA shuffling is able to access 
chromosomal integration to obtain rapid functional improvement 
The goal of this work was to demonstrate the applicability of DNA 
shuffling to large, multigene determinants encoding the detoxifica- 
tion of environmentally relevant heavy metals, specifically arsenate. 

The trivalent and pentavalent ions of arsenic inhibit many 
biochemical processes 7 . Vast amounts of arsenate are released in 
industrial processes, including the mining of arsenopyrite gold 
ores, which constitute about one-third of world gold reserves. 
Microbial release of gold from high-arsenic ores is now performed 
on a commercial scale by microbial oxidation and leaching*. Current 
processes involve incubating large batches of the arsenopyrite ore 
in tanks for extended periods, during which time a high level of 



soluble arsenic is released. Untreated, highly toxic arsenic effluent 
has been disposed of into rivers and has ended up in groundwater 7 . 
Detoxification of soluble arsenic by bioremediation is a commer- 
cially important goal 8 . Our goal was to use molecular breeding to 
evolve a very efficient pathway for arsenate detoxification, which 
would have potential for application to bioremediation of arsenate. 

Results and discussion 

Target pathway. The target DNA used was plasmid pGJ103, a 
5.5-kb pUC19-based plasmid containing a 2.7-kb arsenic resis- 
tance operon from Staphybcoccus aureus plasmid pI258 (including 
the genes arsR, arsB y and arsQ, which was expressed in Escherichia 
colt 9 . This operon confers resistance to the toxic oxyanions arsenate 
(pentavalent arsenic) and arsenite (trivalent arsenic) 7 . This plasmid 
was evolved by three recursive cycles of DNA shuffling, resulting in 
point mutagenesis and recombination, and selection for increased 
resistance to arsenate. 

Resistance evolution. The wild-type plasmid pGJ103 confers 
on E. coli TGI resistance to about 4 to 10 mM arsenate when 
grown on LB plates at 37°C for 24 h. Selection round 1, which was 
plated on 2 to 32 mM arsenate, yielded about 2,000 colonies growing 
at 16 mM arsenate. Selection round 2 was plated on 16 to 128 mM 
arsenate and yielded about 4,000 colonies growing at 64 mM arse- 
nate. Round 3 was plated at 64 to 256 mM arsenate and yielded 
about 1,500 colonies at 128 mM arsenate. Colonies were harvested 
from the plates with 128 mM arsenate and replated on 200 to 
400 mM arsenate. This gradual increase in resistance during the 
selection process suggests that more than one mutation is involved. 

Clone AC3. The largest colony from a plate with 400 mM 
arsenate from round 3 was found to grow in liquid culture con- 
taining up to 500 mM arsenate (Fig. 2B). This clone was called 
AC3. In addition to arsenate, resistance of AC3 was also increased 
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by fourfold to sixfold to arsenite (As0 2 ") salts (Fig. 2C), a toxic 
oxyanion to which resistance requires the ArsB membrane pump 
but not the arsenate reductase enzyme, ArsC This suggests muta- 
tional effects beyond the arsC gene, which only affects resistance 
to arsenate*. 

Integration. Because no plasmid DNA could be recovered from 
any of the selection round 3 clones, the ars operon needed to be 
recovered by PCR from total cellular DNA. Total cellular DNA of 
AC3, digested with Sad and SphI, was cloned into pUCl9 before 
sequencing. The operon could only be recovered from AC3 total 
cellular DNA with primers located at the inner ends of the repeats 
flanking the operon, which are the terminal portions of site- 
specific recombinase genes'. Primers located in the middle or at 
the outside of the repeats did not work, suggesting that in AC3 the 
ars operon had integrated into the chromosome via the 200-bp 
homologous regions. The PCR product containing the evolved 
operon recovered from chromosomal DNA was reinserted into the 
native pGJ103 plasmid by mixing with the original pGJ103 plas- 
mid DNA, fragmenting the mixture with DNasel, reassembling it 
by primersiess PCR, transformating it back into E. coli> and select- 
ing for resistance to 400 mM arsenate. Again, the operon appeared 
to have integrated into the chromosome because in clones resistant 
to 400 mM arsenate, no plasmid could be detected, and the operon 
could only be recovered by PCR of chromosomal DNA. 

Selection control without shuffling. As a control, a similar 
selection protocol with four cycles of plating on 2 to 32 mM arsen- 
ate followed by growth on liquid media was carried out without 
shuffling. The arsenate resistance remained unchanged at 8 mM, 
and the plasmid remained episomal. Therefore, some or all of the 
mutations that resulted from the DNA shuffling were required for 
the integration and the high-resistance phenotype. 

Mutations. DNA sequencing of the entire 2.3-kb ars operon 
of AC3 showed 13 base changes relative to the original sequence' 
(Fig. 3). Two silent mutations were found in arsR (T389C and 
T429C), and one was found past the C-terminus of arsC (G2469C). 
Seven silent mutations and three mutations leading to amino acid 
substitutions were located in arsB, the arsenite membrane pump. 

The amino acid changes resulted in substitution of hydropho- 
bic amino acids with more hydrophilic residues: mutation T1281C 
resulted in Leu232Ser, mutation T1317C resulted in Phe244Ser, 
and mutation T1853C resulted in Tyr423His (numbering as in 
reference 9). The other seven mutations, T961G, A976G, T1267C, 
A1402G, T1730C, T1819C, and T1844C, were silent and did not 
result in predicted amino acid changes. 

Improvement mechanism. Substitution of hydrophobic 
with hydrophilic amino acids in a heterologously expressed pro- 
tein was also found for the evolved green fluorescent protein 5 , 
where it was shown to confer 
improved protein folding. For 
the ArsB protein, which is an 
inner membrane protein whose 
topology has been experimen- 
tally mapped to 12 transmem- 
brane alpha-helical regions in 
the related plasmid R773 (ref. 
10), the explanation may also be 
improved folding. However, 
even from this structural model 
no hypotheses concerning the 
functional changes can be 
inferred. For a protein whose 
overexpression is toxic for 
cells' 0 , the mutations may have 
resulted in a less toxic protein 
and therefore allowed a higher 
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Figure 1 . DNA shuffling. A single sequence is randomly fragmented 
by DNasel, and the fragments are reassembled based on sequence 
similarity by primerless PCR. The diversity consists of the variable 
level of mutations that is introduced by the PCR reaction. After the 
first selection cycle, the mutations in the pool of selected sequences 
are partially unlinked by random fragmentation of the DNA. Following 
reassembly of the fragments, new combinations of mutations are 
formed within a conserved framework sequence, and a controlled 
level of new point mutations is formed as well. When coupled with 
selection or screening, this process aHows rapid accumulation of 
useful combinations of mutations (green) from multiple parental 
sequences, while at the same time removing detrimental combina- 
tions of mutations ( 
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Figure 3. DMA sequence characterization of the evolved operon. The 
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the three genes. 
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expression level resulting in the increased arsenate resistance. An 
alternative explanation for the increased arsenate resistance is an 
increased V mBX of ArsB for arsenate. 

Arsenate reductase activity. The activity of arsenate reductase 
by whole evolved AC3 cells was increased about 12-fold (Fig. 4) 
compared with the control strain with the native pGJ103 plasmid. 
This increase in whole cell reductase rate appeared to result from 
an increase in the rate of reduction (V) and not from an enhanced 
affinity of arsenate for the cells (K,,,; data not shown). This is con- 
sistent with the finding that the mutations occurred in the efflux 
transport protein (ArsB) and not in the arsenate reductase itself 
(ArsC). A possible explanation is better coupling between ArsB 
and ArsC, thereby increasing the arsenate reductase activity even 
though no mutations are present in the arsC protein. 

These results demonstrate that DNA shuffling can dramati- 
cally and rapidly improve the performance of a multigene 
operon, independently of a detailed understanding of the mecha- 
nism. For many applications, it will not be necessary to rational- 
ize the improvements mechanistically or a priori. While the 
shuffling process is simple, like traditional breeding, improve- 
ments can arise by a wide variety of complex or simple mecha- 
nisms, most of which would have been missed by rational design. 
We expect that access to a wide variety of mutational mecha- 
nisms will generally be important to obtain improvements reli- 
ably and rapidly. Some general mechanisms that are likely to be 
important for sequence evolution in nature, such as the integra- 
tion strategy we have observed, are not yet a standard part of our 
current molecular biology toolbox and therefore provide a 
refreshing view of evolutionary processes. 



Experimental protocol 

DNA shuffling. The shuffling of pGJ103 was performed as described', using 
400 to 1,500-bp DNA fragments, which were generated by DNasel (Sigma, 
St. Louis, MO) and reassembled by PCR without added primers and using 
a PCR program as described previously 3 . The PCR product was digested with 
BamHI and the 5.5-kb fragment was purified from an agarose gel, self-Hg- 
ated, and electroporated into E coli strain TGI. 

Selections. Cells were plated on LB plates containing a range of concen- 
trations of sodium arsenate (Sigma) and incubated at 37°C for 24 h. 
About 1,500 to 4,000 colonies from the plates with the highest arsenate 
levels were pooled and cultured in liquid LB broth with a similar level of 
arsenate. Plasmid DNA was prepared from this liquid culture and used for 
shuffling. 

Selection without shuffling. As a control, a similar selection protocol was 
carried out without shuffling. Four cycles of selection on plates with 2, 4, 
8, 16, and 32 mM arsenate were performed. DNA (1 u.g) was prepared from 
the most resistant 2,000 to 4,000 colonies and electroporated into TGI cells 
without shuffling. 

Cycling. The second and third shuffling were identical to the first, except 
that the cells were plated at increasingly higher arsenate levels. After selection 
round 3 the ars operon required recovery by PCR from total cellular DNA 
due to poor plasmid yield. A 2.3-kb fragment containing the entire ars 
operon but not the 200-bp repeats was recovered from total cellular DNA of 
AC3 by PCR amplification with oligonucleotide primers (Sad: 5'ACTC- 
CGTCCACCT£A£dCTTGGTATT ACACGTCAAACTA (3' A is base 187 
in reference 61 and SphI: 5'AACCCTCCCTC GCATGC AAT ATTACAA- 
GACAG ACAGTT [3' T is opposite base 2512 in reference 9]). The PCR 
product containing the ars operon from clone AC3 was digested with SacI 
and SphI and cloned into pUCl9. 

DNA sequencing. The entire 2.3 operon from AC3 and from AC5 was 
sequenced by fluorescent dideoxy termination and with an ABI model 373 
(Fig. 3). 

Arsenate resistance measurement. Cells were grown overnight in LB with 
0, 2, 10, or 128 mM arsenate, respectively and were diluted 10,000 times into 
LB with added oxyanions as indicated. Turbidity was measured after 16 h 
growth at 37°C. E. coli strain TGI (Pharmacia, Uppsala, Sweden) was used 
for all experiments. Sodium arsenate was used as a 2.5 M stock solution, neu- 
tralized to pH 7 with NaOH. 

Arsenate reduction measurement. The ability of £. coli constructs to 
detoxifiy arsenate was measured by an intact cell arsenatej-eduction assay 11 . 
Cells were grown as in Figure 1 and were washed and resuspended in 
methanol amine buffer* and exposed to 4 mM "As-arsenate. Samples were 
removed periodically and heated to 100°C, followed by centrifugation at 4°C, 
and samples were frozen. Radioactive "AsO* 1- was used as substrate, and 
"As-arsenate and "As-arsenite were separated by thin layer chromatography, 
followed by quantitation in an Ambis radioactive counter. 
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As part of an in vivo study of carbohydrate metabolism during 
development of Zea mays L. kernels, quantities of nucleotides and 
nucleotide sugars were measured in endosperm extracts from nor- 
mal, the single-mutant genotypes shrunken- T (shf), shrunken-2 
(s/i2), and brittle-1 (bt1), and the multiple-mutant genotypes 
shlbtl, sh2bt1, and sb1sb2bt1. Results showed that bt1 kernels 
accumulated more than 13 times as much adenosine 5' diphospho- 
glucose (ADP-CIc) as normal kernels. Activity of starch synthase in 
bt 1 endosperm was equal to that in endosperm extracts from nor- 
mal kernels. Thus the ADP-CIc accumulation in btl endosperm cells 
was not due to a deficiency in starch synthase. ADP-CIc content in 
extracts of shlbtl endosperms was similar to that in btl, but in 
extracts of the sh2bt1 mutant kernels ADP-Glc content was much 
reduced compared to btl (about 3 times higher than that in nor- 
mal). Endosperm extracts from sh1sh2bt1, kernels that are deficient 
in both ADP-CIc pyrophosphorylase (ACPase) and sucrose syn- 
thase, had quantities of ADP-CIc much lower than in normal ker- 
nels. These results clearly indicate that AGPase is the predominant 
enzyme responsible for the in vivo synthesis of ADP-CIc in btl 
mutant kernels, but Sue synthase may also contribute to the syn- 
thesis of ADP-CIc in kernels deficient in AGPase. 



A number of maize (Zea mays L.) endosperm mutants 
that affect the quantity and quality of carbohydrates in the 
endosperm, as well as kernel development and morphol- 
ogy, have been identified and extensively studied (Shan- 
non and Garwood, 1984). Mutants such as waxy (wx), amy- 
lose extender (ae), and sugary-1 affect the quantity and 
branching characteristic of kernel polysaccharides (Shan- 
non and Garwood, 1984). Several other mutants involve a 
defect in the metabolism of sugars, resulting in the accu- 
mulation of Sue and the reduction in starch. For example, 
shl kernels are deficient in the major Sue synthase enzyme 
(Chourey and Nelson, 1976), and shl and brittle-2 mutant 
kernels are both deficient in AGPase (Preiss, 1991). The 
genetic lesion of btl, another high-sugar/ low-starch mu- 
tant, is not as well defined. Early screening of developing 
btl kernels for the activity of various enzymes of carbohy- 
drate metabolism and starch biosynthesis failed to identify 



a specific enzyme lesion. Developing btl kernels, compared 
to normal, have been reported to be low in a starch gran- 
ule-bound phospho-oligosaccharide synthase (Pan and 
Nelson, 1985) and starch debranching enzyme but twice as 
high in AGPase (Doehlert and Kuo, 1990). 

Sullivan et al. (1991) reported the isolation and analy- 
sis of the Btl gene from maize and showed that the 
protein with greatest similarity to the Btl -encoded pro- 
tein is a yeast adenylate translocator. Li et al. (1992) 
showed that the in vitro translated Btl gene product 
could be imported into chloroplasts, where it was pro- 
cessed and localized to the inner envelope membrane. 
We have isolated amyloplast membranes from normal 
and btl kernels and showed that the four most abundant 
amyloplast membrane polypeptides (38-44 kD) from 
normal kernels were specifically recognized by antibod- 
ies raised against BT1 (Cao et al., 1995). In contrast, the 
four abundant BT1 immunoreactive polypeptides were 
missing from amyloplast membranes isolated from btl 
mutant kernels. Liu et al. (1992) reported that amylo- 
plasts isolated from young maize kernels effectively take 
up ADP-Glc for starch synthesis, but amyloplasts from 
btl are only 25% as active in ADP-Glc uptake and incor- 
poration into starch as amyloplasts from normal maize 
endosperm. The precise function of the Btl gene product 
is unknown, but it must be important to starch accumu- 
lation, since starch content of btl kernels is only about 
20% of that in normal maize endosperm (Tobias et al., 
1992). 

As part of a continuing in vivo /gene tic approach to the 
study of carbohydrate metabolism in normal and starch- 
deficient maize endosperm mutant genotypes, we discov- 
ered that btl kernels accumulate very high levels of ADP- 
Glc. In this paper we report the nucleotide composition of 
endosperm extracts from normal and mutant genotypes. 
Results of a study designed to determine the enzyme(s) 
responsible for the in vivo synthesis of ADP-Glc in btl 
kernels are also reported. 
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diphosphoglucose; UGPase, UDP-Glc pyrophosphorylase; UPTG, 
UDP-Glc:protein transglucosylase. 



835 



836 



Shannon et al. 



Plant Physio!. Vol. 110, 1996 



MATERIALS AND METHODS 

Plant Material 

The normal maize (Zea mays L.) inbred W64A and the 
endosperm mutant genotypes shl and bt 1 in a near isogenic 
W64A background were grown in the field at the Russell E. 
Larson Agricultural Research Farm (Center County, PA) 
during the summer of 1994. Additional plants for the single 
mutants btl, shl, and shl and the multiple mutants shlbtl 
and sh2btl, also in a near-isogenic W64A background, were 
grown in 20-L plastic pots containing a loose growing 
medium consisting of two parts peat, two parts perlite, and 
one part soil. The sh2 plants were grown in the spring of 
1993, and the other single-, double-, and triple- (shlsh2btl) 
mutant genotypes were started in the greenhouse in late 
spring of 1994 and then transferred outside the greenhouse 
for continued growth. High-intensity sodium lamps were 
used in the greenhouse to extend the daylength to 16 h. For 
the experiments comparing nucleotide levels in endosperm 
tissue containing different dosages of the Btl gene and 
different developmental ages (10 and 20 DPP), the plants 
were grown in the greenhouse during the spring of 1993. 
For the dosage experiment, cross-pollinations and recipro- 
cal cross-pollinations were made between W64A and btl. 
All ears were hand-pollinated and harvested 20 DPP (ex- 
cept for the developmental age experiment). For plants 
grown in 1993, kernels were cut from the cobs leaving 
approximately the lower one-fifth on the cob. The pericarp 
and embryo were removed from each kernel, and the en- 
dosperm was quickly frozen in liquid nitrogen and freeze 
dried. To reduce the time between ear removal from the 
plant and freezing, in 1994, the kernels were cut from the 
ear, quickly frozen in liquid nitrogen, and freeze dried 
prior to removing the embryos. The dried endosperms and 
pericarps were pulverized in a mortar and pestle, taking 
care not to finely pulverize the pericarp. Most of the peri- 
carp was removed by sifting the powdered endosperm 
through a 40-mesh stainless steel sieve (Cellector Tissue 
Sieve, E-C Apparatus Corp., Thomas Scientific, Swedes- 
boro, NJ). Although the "endosperm" samples from the 
kernels harvested in 1994 may have contained some peri- 
carp tissue, in our opinion the advantage of the increased 
speed of freezing the samples outweighed the disadvan- 
tage of the slight pericarp contamination. 

Identification of Double and Triple Mutants by 
Dot Blot and Embryo Culture 

To isolate the double-mutant genotypes, homozygous 
shl and sfi2 plants were pollinated with pollen from ho- 
mozygous btl plants. The shl, sh2, and btl plants were all 
in a near-isogenic W64A inbred background. To produce 
the triple-mutant genotype shlsh2btl, a shlsh2 double-mu- 
tant plant (the shlsh2 kernels, a gift from William Tracy, 
University of Wisconsin, Madison, were not isogenic with 
W64A) was pollinated with pollen from a homozygous btl 
plant. F t seeds resulting from these crosses were planted in 
20-L containers as described above, and the plants were 
grown in the greenhouse in the spring of 1994. The F x 
plants were self-pollinated and the F 2 ears were harvested 



20 DPP. Intact kernels were removed from the ears and 
surface sterilized for 15 nun in a 20% chlorine bleach solu- 
tion containing two drops of Tween 20 per 100 mL of 
solution. The kernels were rinsed in sterile water, and the 
upper one-third of at least 100 kernels from each ear was 
excised for immimoassay by tissue printing. For the immu- 
noassay, the cut surface of each endosperm was blotted 
onto two (for the double-mutant selections) or three (for 
the triple-mutant selections) nitrocellulose membranes (see 
below). The rest of each kernel, including the embryo, was 
placed in individual wells of a sterile EUSA plate. Each 
well contained one to two drops of sterile water. The ELISA 
plates containing the kernel pieces were closed and stored 
in the cold for about 24 h until the results of the dot blot 
immunoassay were complete. Care was taken to maintain 
the identity of the residual kernel pieces with the dot blots. 
Putative shlbtl kernels were identified by the absence of 
BT1 antibody-reacting proteins (the BT1 antibody was a 
gift from Thomas Sullivan, University of Wisconsin, Mad- 
ison) on one membrane and the absence of Sue synthase 
(SHI) antibody-reacting proteins (the SHI antibody was a 
gift from Prem Chourey, U.S. Department of Agriculture/ 
University of Florida, Gainsville) on a second membrane. 
Likewise, the shlbtl and shlshlbtl kernels were identified 
as those missing the SH2 (antibodies to the SH2 polypep- 
tide were a gift from L. Curtis Hannah, University of 
Florida, Gainsville) and BT1 polypeptides and those miss- 
ing the SHI, SH2, and BT1 polypeptides, respectively. The- 
oretically, one-sixteenth of the F 2 kernels on F a plants from 
the shl x btl or sh2 X btl crosses should be double mutants 
and one-sixty-fourth of the F 2 kernels on the shlshl X btl ¥ x 
plants should be the triple-mutant genotype. The embryos 
from 22 punitive shlbtl, 18 shlbtl, and 3 shlsh2btl kernels 
were excised and cultured in vitro according to the proce- 
dure of Smith (1992). The cultured embryos were allowed 
to germinate in the light at 22°C for 14 d. The established 
seedlings were then removed from the tubes and trans- 
planted to pots containing Fafard Canadian Growing Mix 
No. 2 (Conrad Fafard Inc., Agawam, MA). After 14 d of 
growth in the culture room, the seedlings were trans- 
planted to 20-L plastic pots and growth continued in the 
greenhouse and then outdoors. Each plant was self-polli- 
nated, and two-thirds of the ear were harvested 20 DPP for 
analysis. The genotype of each ear was confirmed by dot 
blot immunoassay and by enzymatic analyses, and the 
remaining one-third of each ear was allowed to develop to 
maturity. Of all of the plants tentatively identified as mul- 
tiple mutants, 19 of 22 were shlbtl, 9 of 18 were sh2btl, and 
1 of 3 was shlsh2btl. 

The method for tissue printing was adapted from the 
procedure described by Campillo (1992). Briefly, the nitro- 
cellulose membrane was soaked in 0.2 M CaCl 2 for 20 min 
with shaking and dried on 3-MM Whatman chromato- 
graphic paper. Then the membrane was printed with en- 
dosperm tissue and blocked in a tray for 1 to 2 h with the 
blocking buffer (5% [w/v] nonfat dry milk, 1% [w/vl BSA, 
1 M Gly, and 0.02% sodium azide in TTBS buffer [20 mM 
Tris-base, pH 7.5, 150 mM NaCl, and 0.05% Tween 20]) by 
gentle shaking. The blocking buffer was replaced with 
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TTBS buffer containing 1% nonfat dry milk, 0.02% sodium 
azide, and the primary antibody and gently shaken for 1 to 
2 h. After the printed membrane was washed twice with 
TTBS buffer for 5 min each time, it was incubated in the 
blocking buffer containing the secondary antibody for 1 to 
2 h with gentle shaking. The secondary antibody for SHI 
consisted of 10 /xL of goat anti-mouse IgG-alkaline phos- 
phatase conjugate (Sigma catalog No. A5153) in 10 mL of 
blocking buffer. For SH2 and BT1, 10 uL of goat anti-rabbit 
IgG-alkaline phosphatase conjugate (Sigma catalog No. 
A9919) was added to 10 mL of blocking buffer. The mem- 
branes were then washed twice with TTBS buffer for 10 
min each time and once with TBS (TTBS buffer without 
Tween 20) for 10 min. The reaction was then detected with 
0.015% (w/v) 5-bromo-4-chloro-3-indoyl phosphate and 
0.03% (w/v) nitroblue tetrazolium in AP buffer (0.1 m Tris 
base, 0.1 m NaCl, and 5 mM MgCl 2/ pH 9.5). The reaction 
was stopped by quickly washing the membrane with dis- 
tilled water. Finally, the membrane was sealed in a plastic 
bag and stored in a cool place for future reference. 

Enzyme Extraction and Assay 

Fifty milligrams of freeze-dried endosperm tissue were 
homogenized in 2 mL of extraction buffer (10 mM Tricine, 
pH 7.2, 1 mM DTT, 1 mM EDTA, 0.5 mM PMSF) and then 
centrifuged at 20,000^ for 20 min in the cold (4°C). The 
crude supernatant was used for the assay of AGPase, 
UGPase, soluble starch synthase, and SBE. The pellet was 
washed once with extraction buffer and then suspended in 
extraction buffer, and an aliquot was removed for assay of 
starch granule-bound starch synthase. Prior to the assay of 
Sue synthase, an aliquot of the supernatant was desalted by 
centrifugal filtration on Sephadex G-25 columns equili- 
brated with extraction buffer. The procedures described by 
Echeverria et al. (1988) were followed for assay of AGPase, 
UGPase, starch synthase (including soluble and granular 
bound), and SBE, except that the activity of SBE was mea- 
sured using nonradioactive Glc-l-P as a substrate and the 
released Pi was determined as described by Lanzetta et al. 
(1979). The Pi released in reactions without added phos- 
phorylase a was subtracted from that released in the com- 
plete reaction mixture. Sue synthase activity was measured 
in the synthetic direction according to the procedure of 
Guy et al. (1992). Soluble protein in the enzyme extracts 
was determined using the Bio-Rad protein assay based on 
the Bradford method. BSA was used as a standard. 

Metabolite Extraction 

Equal weights of pulverized, sifted endosperm from 
three shlbtl ears were combined to produce composite 
sample 1 and sifted endosperm from another three ears 
combined for composite sample 2. Likewise, for the two 
sh2btl composite samples, equal weights of pulverized, 
sifted endosperm from four and three ears were combined. 
Pulverized, sifted endosperm from a single ear was used 
for the shlshlbtl samples. Pulverized endosperm from sev- 
eral ears were combined for the composite samples of 
normal (W64A), btl, shl, and sh2. 



Duplicate 0.5-g samples of freeze-dried endosperm tis- 
sues from the single-mutant and triple-mutant genotypes 
and from the two composite samples of the double-mutant 
genotypes were homogenized in a 10-mL Potter-El vehj em- 
type homogenizer with a Teflon pestle (Thomas Scientific) 
with 3 mL of 0-8 M HC10 4 in a frozen slurry state. The ice 
slurry was obtained by mixing dry ice powder with the 
tissue and HC10 4 prior to homogenization. Then, the ho- 
mogenates were immediately centrifuged at 2,500# in the 
cold (4°C) for 10 min, and the pellets were washed twice 
with cold water (1 mL) each. The combined supernatants 
(5-6 mL) were collected in 40-mL conical centrifuge tubes 
and neutralized by addition of an equal volume of cold 0.5 
m tri-n-octylamine in 1,1,2-trichlorotrifluoroethane. The ex- 
tract was mixed vigorously for 30 s. Three layers formed 
during centrifugation at l,000g (4°C) for 5 min. The inter- 
mediate layer solidified during centrifugation, and the up- 
per aqueous layer was withdrawn from the tube. Three to 
4 mL of H 2 0 were added to the residual layers, and after 
the sample was mixed and centrifuged, the upper aqueous 
layer was removed and added to the initial aqueous ex- 
tract. This step effectively removes the perchlorate from the 
aqueous fraction and adjusts the pH to between 6.0 and 6.5 
(Heinrich and Rapoport, 1974). The combined aqueous lay- 
ers were freeze dried for 24 h. The dried extracts were 
dissolved in water, and any insoluble residues were re- 
moved by centrifugation at 16,000# for 10 min (Eppendorf 
centrifuge 5415C) at 4°C. The clarified supernatants were 
retained for metabolite analysis. In a prelirrtinary study we 
determined the recovery following extraction of added 
standards, and all metabolite data were corrected based on 
the percentage recovery of the individual metabolite. 



Measurement of Neutral Sugars and Starch 

Total reducing sugars and Sue in the extracts were de- 
termined by the Nelson test (Hodge and Hoereiter, 1962) 
and the anthrone test (Ashwell, 1957), respectively. To 
quantify starch granule number and quantity, the pellet 
remaining after metabolite extraction was suspended in 
water and aliquots were removed for counting and quan- 
titative measurement. For quantitative measurement, 
10-mL aliquots of the starch suspensions were gelatinized 
for 30 min in a boiling water bath. After adjustment to a 
known volume, 1-mL aliquots were added to an equal 
volume of 1.0 M sodium acetate, pH 4.8, containing 10 units 
of amyloglucosidase. Following an overnight incubation at 
37°C, the digests were heated in a boiling water bath for 1 
min to inactivate the enzyme and centrifuged at 20,000g for 
10 min. Glc in the supernatants was determined by the 
Nelson test (Hodge and Hoereiter, 1962). To determine the 
number of starch granules, aliquots of the starch suspen- 
sion were appropriately diluted and the number of gran- 
ules was determined using a Spencer Bright Line Improved 
Neubauer hemacytometer (0.1 mm deep) (Hausser Scien- 
tific, Blue Bell, PA). The numbers of starch granules in 
random 0.004-mm 3 areas of the hemacytometer were 
counted using a microscope, and the total number of starch 
granules per sample extract was calculated. 
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Measurement of Nucleotides and Nucleotide Sugars 

The major nucleotides (UMF, UDP, UTP, AMP, ADP, 
and ATP) and nucleotide sugars (UDP-Glc and ADP-Glc) 
were separated and quantified by ion-paired HPLC using 
a reversed-phase Adsorbosphere Nucleotide-Nucleoside 
7-/un column (250 X 4.6 mm; Alltech Associates, Deerfield, 
IL). The mobile phases contained: A, 20 mM KH 2 P0 4 and 5 
mM tetrabutylammonium phosphate, pH 5.0; B, 100% 
HPLC-grade methanol. The flow rate was 1.5 mL/min. The 
column was equilibrated with 95% A and 5% B, and then 
the sample was injected and a linear gradient program 
from 5% B to 50% B was processed. The 50% B gradient 
condition was reached in 20 irtin, and elution continued at 
the same condition for another 6 min. Each nucleotide was 
identified by co-migration with commercial standards, and 
each peak was confirmed by matching the spectrum of each 
nucleotide peak against that of the standard compounds. 
The quantity of each nucleotide and nucleotide sugar was 
determined by comparing the sample peak areas to areas 
from a standard curve. A Waters Associates HPLC system 
consisting of dual model 510 pumps, Waters 712 WISP 
autoinjector, and Waters 996 Photodiode array detector 
was used for nucleotide analysis. A computer with Waters 
Millennium 2010 Chromatograph Manager software was 
connected to the HPLC system to control all of the pro- 
grams for acquiring and processing the data and calculat- 
ing the results. The A 260 was extracted from the photodiode 
array for quantifying each nucleotide and nucleotide sugar. 
The relative purity of each peak was determined by com- 
paring the sample nucleotide spectra with standard nucle- 
otide spectra previously stored in the computer. Experi- 
ments to determine the effects of Btl gene dosages and 
developmental ages on nucleotide composition were com- 
pleted earlier using an older Waters HPLC system, which 
consisted of Waters dual model 501 pumps, Waters auto- 
mated gradient controller, Waters model 441 UV absor- 
bance detector (254 nm, 0.2 absorbance units full scale), and 
the 740 data module. 

RESULTS 

Recovery Study for Extraction Procedure 

In the study reported by Tobias et al. (1992), we encoun- 
tered difficulties associated with excess salt following neu- 
tralization of the HC10 4 extract with K 2 C0 3 . The residual 
KCIO4 salt in the concentrated extracts limited the size of 
aliquot that could be used for enzymatic analysis of the 
hexose-Ps and triose-Ps. Thus, prior to beginning the 
present study we compared several extraction and neutral- 
ization methods. The procedure used in this study was 
judged to be superior to that used previously (Tobias et al., 
1992). The HPLC protocol effectively separated at least 15 
compounds that were identified and 5 unidentified peaks 
(Fig. 1). The identities of the nucleotides were determined 
by comigration with known standards and confirmed by 
matching their spectra to those of standards. AMP eluted 
as the middle peak between two unidentified peaks. The 
larger unknown peak c has a maximum A at 247 nm and its 
spectrum did not match any of the standard nucleotide 



spectra. The UDP, ADP-Glc, and ADP peaks in W64A each 
had leading shoulders. When these nucleotides were quan- 
tified, the integrator was set to exclude the area under the 
shoulders. The elution time and spectrum of minor peak d 
matched that of GTP, but since it was very low in all 
genotypes, the quantities of GTP are not reported. Un- 
known peak e, a contaminant in the eluant, had a maxi- 
mum A at 280 nm. Recoveries for the uridine and adeno- 
sine nucleotides, nucleotide sugars, and NAD ranged 
between 91 and 112% (Table I). Nucleotide quantities re- 
ported were corrected by the respective percentage of re- 
covery. Although data concerning the contents of hex- 
ose-Ps and triose-Ps are not included in this paper, we note 
that improved recoveries of these metabolites were ob- 
tained using this extraction procedure. For example, with 
the current procedure we obtained 80 and 71% recoveries 
of added dihydroxyacetone phosphate and glyceralde- 
hyde-3-P, respectively, a significant improvement over the 
66 and 28% recoveries of these metabolites reported by 
Tobias et al. (1992). 

Enzyme Activities 

Preliminary selections of multiple-mutant genotypes 
were based on immunoassays of endosperm tissue prints 
from the segregating F 2 ears harvested 20 DPP (see "Ma- 
terials and Methods"). Embryos from putative multiple- 
mutant genotypes were "gernunated" in vitro, and the 
resulting plants were self-pollinated and ears sampled 20 
DPP. Final confirmations of the multiple-mutant genotypes 
were based on the absence of the endosperm-specific BT1 
polypeptide and activities of AGPase and Sue synthase in 
extracts of the freeze-dried 20-DPP endosperms. Dot blots 
of extracts from normal, shl, and shl were positive for 
polypeptides reacting with antibodies raised against BT1, 
but BT1 antibody-reacting proteins were completely absent 
from all blots of fctl-containing genotypes (data not 
shown). As expected, Sue synthase and AGPase activities 
were very low in all shl- and s/i2-containing genotypes, 
respectively (Table II). Soluble protein and activities of 
UGPase, starch synthase (soluble plus granule bound), and 
total SBE in the mutant genotypes were similar to those in 
normal kernels. 

Kernel Characteristics of Normal and Single- and Multiple- 
Mutant Genotypes 

Mature kernel phenotypes of the multiple-mutant geno- 
types are indistinguishable from btl or shl single-mutant 
kernels. Kernels of the double- and triple-mutant geno- 
types are viable. In this study we did not measure the 
endosperm dry weights of the various genotypes. How- 
ever, we did determine the dry weights per endosperm of 
these same genotypes grown in the greenhouse in the late 
winter and spring of 1995 and harvested 20 DPP. The 
average dry weights per endosperm for normal and shl 
were similar at 42.3 ± 5.9 and 41.8 ± 2.5 mg, respectively. 
Weights per endosperm of shl, btl, shlbtl, and shlbtl were 
similar at 34.6 ± 2.1, 35.0 ± 2.7, 34.7 ± 7.5, and 32.5 ± 3.9 
mg, respectively. Although these weights per endosperm 
are from plants grown in a different year from those ana- 
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Figure 1. HPLC separation of nucleotide stan- 
dards (top, Std.), endosperm extracts from the 
normal W64A inbred (middle), and endosperm 
extracts from the bt1 mutant (bottom). A 2S0 is 
plotted. Peaks are numbered according to the 
order of separation of the following standards: 1 , 
uridine; 2, adenine; 3, UMP; 4, adenosine; 5, 
NAD; 6, AMP; 7, UDP-CIc; 8, UDP; 9, NADP; 
1 0, ADP-Clc; 1 1 , ADP; 1 2, NADH; 1 3, UTP; 1 4, 
ATP; 15, NADPH. Peaks a to e are unidentified. 
AU, Absorbance units of A 260 . 
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lyzed in this study, we are safe in assuming that the rela- 
tive weights per endosperm of normal (W64A) and the 
mutant genotypes will be similar, and thus these weights 
may be used to estimate content per endosperm. Sue and 
total reducing sugar contents in endosperms of the multi- 
ple-mutant genotypes were very high (Table III) as previ- 
ously reported for the shl and btl genotypes (Tobias et al., 
1992). Normal endosperms contained 2.14 X 10 6 starch 
granules mg"" 1 freeze-dried tissue (90.5 X 10 6 / endosperm) 
with an average of 293 pig starch 10~ 6 granules (Table III). 
The shl endosperm contained about twice as many smaller 
starch granules as normal (183.1 X 10 6 / endosperm), 
whereas the sh2, btl, shlbtl, sh2btl, and shlsh2btl en- 



dosperm samples contained more than 4 times as many 
very small granules (Table 111). The number of starch gran- 
ules per endosperm for shl, btl, shlbtl, and sh2btl were 
estimated to be 604.5 X 10 6 , 466.2 X 10 6 , 370.2 X 10 6 , and 
312.7 X 10 6 , respectively. 

Nucleotide Quantity in Normal (W64A) Endosperm Cells 

In normal endosperms the sum of uridine nucleotides 
was 1. 7-fold higher than that of adenosine nucleotides. 
UTP was the highest of all nucleotides, and it was 2.4-, 1.8-, 
and 1.2-fold higher than UMP, UDP, and UDP-Glc, respec- 
tively (Fig. 2). Likewise, ATP was about 2.4-, 1.7-, and 
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Table I. Metabolite levels in normal (W64A) endosperm (20 DPP) 
and the percentage recovery of added standards 

One-half gram samples of dry, pulverized endosperm from W64A 
kernels were extracted, and nucleotides were measured as described 
in "Materials and Methods." A mixture of nucleotides containing the 
indicated quantities were added to a second set of 0.5-g endosperm 
samples prior to extraction. Data are the averages ± sd of two to 
eight extracts. 



Metabolite 


Endosperm 
Extract 


Standard 
Added 


Standard 
Recovered 




nmole/g dry 


nmot/g dry wt 


% 






wt 








AMP 


452 ± 49 


400 


106 ± 


40 


ADP 


651 ± 93 


600 


112 ± 


5 


ATP 


1097 ± 62 


1200 


91 ± 


5 


ADP-Glc 


596 ± 29 


600 


97 ± 


3 


UMP 


672 ± 57 


800 


93 ± 


5 


UDP 


930 ± 62 


1000 


105 ± 


2 


UTP 


1636 ±92 


1600 


97 ± 


5 


UDP-CIc 


1416 ± 233 


1400 


101 ± 


11 


NAD 


391 ± 22 


400 


98 ± 


7 



1.8-fold higher than AMP, ADP, and ADP-Glc, respec- 
tively. The quantity of NAD" 1 " was less than AMP (Fig. 2). 
The quantities of NADP" 1 ", NADH, NADPH, cytosine 
nucleotides, and guanidine nucleotides were either very 
low or below the level of detection. 

Nucleotide Quantity in Endosperm Cells of Single Mutants 

All mutant data are presented as percentages of nucle- 
otide quantities in normal endosperm. The most striking 
effect of any of the three mutant genotypes was a 13-fold 
accumulation of ADP-Glc in the btl mutant endosperm 
(Fig. 3). ADP-Glc per endosperm in the normal genotype 
was estimated to be 25.7 ±1.3 nmol/ endosperm, whereas 
contents per endosperm in btl, shl, and shl were 271.1 ± 
31.6, 12.7 ± 2.4, and 34.7 ± 4.0 nmol, respectively. The btl 
endosperm extract was 2-fold higher in ATP and NAD and 
3-fold higher in UDP-Glc but was lower in UMP, UDP, and 
UTP than extracts from normal endosperms (Fig. 3). Ex- 
tracts from shl endosperms were 4-fold higher in UDP-Glc 
but substantially lower than normal in UMP, UTP, and 
ADP-Glc. ADP, ATP, and NAD in shl were higher than in 
normal endosperm. The content of nucleotides in shl en- 
dosperm extracts was similar to that in normal endosperm 
with only ADP and ATP being substantially higher (Fig. 3). 



The Effects of Btl Dosage and Developmental Ages on the 
Quantity of Nucleotides 

With each increase in the number of recessive btl alleles, 
there was an increase in quantity of ADP-Glc and UDP-Glc, 
but the greatest increase occurred in homozygous recessive 
kernels (Table IV). All of the nucleotides in endosperms of 
W64A and btl 10 DPP were much higher than those 20 DPP 
except for the ADP-Glc in btl. Extracts from 10-DPP btl 
endosperms contained 5.5 times more ADP-Glc than their 
normal counterparts, and by 20 DPP the ADP-Glc content 
had increased to a level more than 17 times higher than that 
in normal endosperms (Table IV). 

Nucleotide Quantity in Endosperm Cells of Multiple- 
Mutant Genotypes 

To determine whether Sue synthase or AGPase is respon- 
sible for the in vivo synthesis of ADP-Glc in btl, we measured 
the nucleotides in extracts of shlbtl, shlbtl, and shlshlbtl 
endosperms. ADP-Glc in extracts of the shlbtl double-mutant 
endosperms (deficient in Sue synthase) was similar to that in 
btl (11.5-fold higher than normal), but in extracts of the shlbtl 
double-mutant endosperms (deficient in AGPase) it was only 
about 3 times higher man that in normal endosperms (Fig. 4). 
Extracts of the shlshlbtl triple-mutant endosperms contained 
less than half as much ADP-Glc as extracts from normal 
endosperms. ADP-Glc contents per endosperm for shlbtl and 
shlbtl were estimated to be 239.6 ± 13.0 and 63.0 ± 0.7 nmol, 
respectively. If the kernel weight of the triple-mutant geno- 
type is assumed to be the same as the average weight of the 
double-mutant genotypes, then the ADP-Glc content of 
shlshlbtl endosperms would be 9.4 ± 0.1 nmol/endosperm. 
By comparison, ADP-Glc contents per endosperm in normal 
and btl endosperms were 25.7 and 271.1 nmol, respectively. 
UDP-Glc in extracts from all multiple-mutant genotypes was 
approximately ^fold higher than in extracts from normal 
endosperms. The other uridine nucleotides were similar to or 
lower than in normal endosperms and, except for ADP-Glc in 
shlshlbtl, the adenosine nucleotides and NAD were equal to 
or higher than those in normal endosperm extracts (Fig. 4). 

DISCUSSION 

The mature kernel pheno types of btl and shl are essen- 
tially identical, and the starch, neutral sugars, and hexose-P 
composition of extracts from developing btl and shl are 



Table II. Protein and enzyme activities in normal and mutant maize endosperm 


Genotype Protein AGPase UGPase 


Starch 
Synthase -1 


SBE 


Sue Synthase 



Normal 
shl 
sh2 
btl 

shlbtl 
sh2bt1 
sh1sh2bt1 



fxg mg ' dry wt 
46.6 
48.5 
53.6 
49.8 
53.6 
59.9 
57.9 



10.3 ±0.6 
12.3 ± 1.7 
1.8 ± 0.1 

12.1 ± 0.3 

15.2 ± 2.9 
1.8 ± 0.5 
1.4 ± 0.2 



55.0 ± 16.4 
52.9 ± 20.2 
53.9 ± 10.4 
35.2 ± 0.2 
42.4 ± 0.3 

55.1 ± 3.6 
44.6 ± 4.7 



nmol min 1 mg ' 
0.7 ± 0.0 
1.1 ± 0.0 

1.7 ± 0.0 
1.6 ± 0.1 
1.6 ± 0.2 
1.6 ± 0.1 

1.8 ± 0.2 



dry wt 



132 
177 
221 
179 
194 
222 
207 



1 ± 23.2 
.8 ± 37.6 
9 ± 37.9 

2 ± 45.1 

4 ± 25.7 

3 ± 33.5 

5 ± 39.9 



22.2 ± 7.9 
1.6 ± 0.3 

21.9 ± 4.0 

16.3 ± 5.8 
1.3 ± 0.2 

19.5 ± 0.4 
0.7 ± 0.1 



a Starch Synthase, Sum of soluble and starch granule-bound activities. 
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Table III. Neutral sugars and starch granule number and quantity 
in extracts from 20-DPP normal (WS4A) and mutant genotypes 

Sugar data for the double mutants are the means ± se of duplicate 
extractions from the two composite samples, and data for the other 
genotypes are means ± SE of duplicate extractions from composite 
samples. 



Genotype 


Sue 


Reducing 
Sugars 


Starch Granule 
Number 


Starch 
Content 




nmol/mg dry wt 


Hf/mg dry wt 


fig/10 6 










granules 


W64A 


129.7 ± 8.2 


20.0 ± 0.6 


2.14 


293 


shl 


674.1 ± 9.8 


49.2 ± 1.8 


4.38 


no 


sh2 


825.1 ± 16.0 


37.0 ± 0.9 


17.47 


19 


btl 


1114.4 ± 23.0 


48.0 ± 0.9 


13.32 


19 


shlbtf 


1077.0 ± 58.6 


48.8 ± 3.7 


10.67 


21 


sh2bt1 


981.8 ± 44.1 


39.5 ± 2.4 


9.62 


25 


sh1sh2bt1 


1198.8 ± 42.2 


67.2 ± 0.9 


9.36 


20 



very similar (Tobias et al., 1992). In contrast, ADP-GIc 
content in shl is reduced relative to normal, whereas that in 
btl is 13- to 17-fold higher than in normal endosperm 
extracts (Fig. 3; Table IV). Quantities of the other nucleo- 
tides in sh2 and btl were similar. Two enzymes, AGPase 
and Sue synthase, are capable of synthesizing ADP-GIc in 
vitro. The sfi2 mutant is deficient in AGPase (Tsai and 
Nelson, 1966; Dickinson and Preiss, 1969), and shl is defi- 
cient in Sue synthase (Chourey and Nelson, 1976). To de- 
tennine which enzyme is responsible for the in vivo syn- 
thesis of ADP-GIc in btl kernels, we produced the multiple- 
mutant genotypes shlbtl, shlbtl, and shlsh2btl and 
determined the content of the various nucleotides. ADP- 
GIc in endosperm extracts of the shlbtl double mutant was 
similar (11.5-fold higher than normal) to that in btl, but the 
endosperm extract from sh2btl was only about 3 times 
higher in ADP-GIc than normal endosperm extracts (Fig. 
4). Extracts of the shlsh2btl triple-mutant endosperms con- 
tained less than half as much ADP-GIc as extracts from 
normal endosperms. We conclude from these results that 
AGPase is the predominant enzyme responsible for the in 




NUCLEOTIDES 

Figure 2. Quantity of nucleotides in extracts from normal en- 
dosperms (W64A). Data are the means ± se of four extractions. 
UDPG, UDP-GIc; ADPC, ADP-GIc; DW, dry weight. 
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Figure 3. Quantity of nucleotides in endosperm extracts from the 
single-mutant genotypes btl, sh2 f and shl presented as percentages 
of the quantity in normal endosperms. UDPG, UDP-GIc; ADPG, 
ADP-GIc. 



vivo synthesis of ADP-GIc in btl mutant kernels. However, 
it appears that Sue synthase may also play a role in the in 
vivo synthesis of ADP-GIc, because the extract of shlsh2btl 
endosperm, which is deficient in both AGPase and Sue 
synthase, was much lower in ADP-GIc than extracts of 
sh2btl or normal endosperms. 

AGPase has been reported to be localized in the amylo- 
plast stroma of potato tubers (Kim et al., 1989) and maize 
endosperms (Miller and Chourey, 1995) and in the stroma 
of chloroplasts (Preiss, 1991). Localization of AGPase in 
plastids is generally accepted (Okita, 1992; Smith et al., 
1995). Extracts of btl endosperms contain activities of the 
enzymes for synthesis (AGPase) and utilization (starch 
synthase) of ADP-GIc that are equal to or higher than those 
in extracts from normal endosperms (Table II). However, 
starch accumulation in btl kernels is only 20% of that in 
normal kernels (Tobias et al., 1992). If ADP-GIc is synthe- 
sized in the amyloplast stroma by a plastid-localized 
AGPase and the activity of starch synthase, another plas- 
tid-localized enzyme, is not limiting, then why does the 
ADP-GIc accumulate rather than being utilized in the syn- 
thesis of starch? Synthesis of starch by starch synthase 
requires the presence of an oligosaccharide primer in ad- 
dition to ADP-GIc (Preiss, 1991). UPTG (EC 2.4.1.112) is a 
specialized protein in potato tubers (Lavintman et al., 1974) 
and maize endosperm (Cura et al., 1994; Rothschild and 
Tandecarz, 1994; Tandecarz et al., 1995), which functions in 
the initiation and elongation of the oligosaccharide primer 
required for de novo synthesis of starch (Cura et al., 1994; 
Tandecarz, et al., 1995). Maize UPTG is a multimeric pro- 
tein composed of identical 38-kD subunits. The nuclear- 
encoded BT1 polypeptides (Sullivan et al., 1991) are of a 
similar size (39-44 kD) (Cao et al., 1995). BT1 polypeptides 
have been shown by immunocytochemical localization 
(Sullivan and Kaneko, 1995) and by studies of isolated 
amyloplast membranes (Cao et al., 1995) to be specifically 
localized to the amyloplast membranes of developing 
maize endosperm. BT1 accounts for about 40% of the 
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Table IV. The effects of Btl gene dosage and kernel age on the quantity of nucleotides in maize endosperm 

Total endosperm (0.5 g dry weight) was extracted and nucleotides measured as described in "Materials and Methods/' Values are means ± 
se of two replications. 



Endosperm 
Genotype 


Kernel 
Age 


ADP 


ATP 




ADP-CIc 


UMP 


UDP 


UTP 




UDP-CIc 


NAD 




DPP 










nmol/g dry wt 










BtlBtlBtl 


20 


456 ± 26 


1525 ± 


103 


482 ± 8 


428 ± 27 


924 ± 70 


1425 ± 


139 


1232 ± 80 


547 ± 33 


BtlBtlbtl 


20 


470 ± 10 


1738 ± 


6 


518 ± 2 


701 ± 12 


984 ± 1 


1400 ± 


27 


1380 ± 6 


590 ± 2 


Btlbtlbtl 


20 


408 ± 1 


1695 ± 


24 


581 ± 12 


568 ± 27 


906 ± 16 


1435 ± 


41 


1432 ± 24 


543 ± 10 


btlbtlbtl 


20 


498 ± 20 


2273 ± 


120 


8447 ± 294 


348 ± 4 


654 ± 24 


315 ± 


20 


3108 ± 120 


1212 ± 39 


Bt1Bt1Bt1 


10 


1 204 ± 50 


4323 ± 


96 


709 ± 6 


1058 ±11 


1642 ± 26 


4588 ± 


80 


3514 ± 124 


690 ± 6 


btlbtlbtl 


10 


1252 ± 98 


4800 ± 


206 


■ 3932 ± 185 


1026 ±49 


1692 ± 62 


4027 ± 


177 


4384 ± 180 


1022 ±43 



polypeptides associated with amyloplast membranes from 
normal endosperms but is missing from amyloplast mem- 
branes isolated from btl mutant kernels (Cao et al., 1995). A 
possible explanation for the ADP-Glc accumulation in btl 
kernels is that Btl encodes maize UPTG and in its absence 
de novo starch synthesis is reduced because of a deficiency 
in oligosaccharide primer. Although this is an attractive 
possibility, it was not supported by recent evidence from 
Tandecarz's laboratory. They showed that extracts from 
endosperms of both W64A (wild type) and btl contained 
proteins that reacted with antibodies raised against the 
38-kD potato tuber UPTG protein (A. Rothschild, S. Bocca, 
and J. Tandecarz, unpublished results). In addition, our 
observation of the large number of small starch granules in 
btl endosperms does not support the suggestion that Btl 
encodes synthesis of UPTG. 

A second possible explanation for the high ADP-Glc in 
btl endosperm cells is that the ADP-Glc accumulates in a 
compartment physically separated from starch synthase. 
Since starch synthase is localized in the plastid stroma 
(Echeverria et al., 1988), it follows that in btl endosperm 
cells ADP-Glc accumulates in the cytosol. Kleczkowski et 
al. (1991) and Hannah et al. (1993) suggested that cereal 
endosperms may contain AGPase isozymes in the cytosol 
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Figure 4. Quantity of nucleotides in endosperm extracts from btl 
and the multiple-mutant genotypes shlbtl, sh2btl, and sh1sh2bt1 
presented as percentages of the quantity in normal endosperms. 
UDPG, UDP-GIc; ADPC, ADP-Glc. 



as well as in the amyloplast stroma, and Viiland and 
Kleczkowski (1994) detailed the evidence in support of this 
suggestion. A possible explanation is that BT1 is an adeny- 
late rranslocator, and in its absence (i.e. in btl kernels) 
ADP-Glc synthesized in the cytosol is unable to enter the 
amyloplasts and thus it accumulates. This suggestion is 
supported by the observation that amyloplasts isolated 
from immature btl endosperm were only 25% as effective 
in the uptake and incorporation of ADP-Glc into starch as 
amyloplasts isolated from normal or wx endosperms (Liu 
et al., 1992). In addition, Sullivan et al. (1991) reported that 
the Btl -encoded protein is most closely related to a yeast 
adenylate rranslocator. 

A third possible explanation for the ADP-Glc accumula- 
tion in .the cytosol of btl kernels is that BT1 may be the 
protein transporter required for the transfer and processing 
of the AGPase subunits into the amyloplast stroma. Ac- 
cording to this suggestion the AGPase subunit transfer 
would not occur in btl mutant endosperm cells, and in- 
stead the AGPase subunit(s) would remain in the cytosol 
where it is processed into active AGPase. Ballicora et al. 
(1995) reported that the small subunit of potato AGPase 
expressed in Escherichia coli, without the large subunit, 
exhibited high enzyme activity, but the regulatory proper- 
ties differed from those of the heterotetromeric enzyme 
expressed in E. coli. If we assume that the maize amyloplast 
membrane is unable to transfer ADP-Glc, as in potato 
rubers (Stark et al., 1992), then accumulation of ADP-Glc in 
the cytosol of btl endosperm cells would occur. Further 
studies are needed to conclusively detenrune the function 
of BT1. 
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